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The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specification 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 

Brief Report. The Brief Report should give 
a Clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


Brief Reports 


author’s lines, must not exceed 85 lines av- 
eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except for its 
short lines, it follows the standard style of 
the 1957 revision of the APA Publication 
Manual. Headings, tables, and references are 
avoided or, if essential, must be counted in 
the 85 lines. Each Brief Report must be ac- 
companied by a footnote in the style below, 
which is typed on a separate sheet and not 
counted in the 85-line quota: 


1 An extended report of this study may be ob- 
tained without charge from John Doe (giving the 
author’s full name and address) or for a fee from 
the American Documentation Institute. Order Docu- 
ment No. —— from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$— for microfilm or $— for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


Extended report. Because the extended re- 
port is intended for photoduplication, and is 
not copy to be sent to a printer, its style 
should differ in several ways from that of 
other manuscripts: (2) The extended report 
should be typed with single spacing for 
economy in duplication. (6) Tables and fig- 
ures should be placed adjacent to the text 
which refers to them. A caption should be 
typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, 
the full report is prepared in the style speci- 
fied by the Publication Manual. 
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CLIENT DEPENDENCY AND THERAPIST EXPECTANCY 
AS RELATIONSHIP MAINTAINING VARIABLES 
IN PSYCHOTHERAPY 


KENNETH HELLER 


University of North Dakota 


The relationship and interactional aspects 
of psychotherapy, while long considered im- 
portant, have received ever-increasing theo- 
retical and experimental emphasis in recent 
years. Bordin (1959), for example, has stated: 


The key to the influence of psychotherapy on the 
patient is in his relationship with the therapist. 
Wherever psychotherapy is accepted as a significant 
enterprise, this statement is so widely subscribed 
to as to become trite. Virtually all efforts to theorize 
about psychotherapy are intended to describe and 
explain what attributes of the interactions between 
the therapist and the patient will account for what- 
ever behavior change results (p. 235). 


Previous investigations of the psychothera- 
peutic relationship have frequently been at- 
tempts to identify and interrelate client, 
therapist, and/or transactional variables 
which theoretically appear to be important 
dimensions of the therapist-client interaction. 
The present study, continuing in this direc- 
_ tion, has focused upon client-therapist at- 
traction as its major concern. This potentially 
significant dimension of the therapeutic rela- 
tionship has been defined by Libo (1957), in 
a general sense, as “the resultant of all forces 
acting on the patient to maintain his relation- 
ship with the therapist.” To implement this 
definition, he developed the Picture Impres- 
sions Test, a projective technique designed to 
call forth client verbalizations concerning 
feelings toward therapists and the therapy 
process. With client-therapist attraction de- 
fined in this manner, Libo (1957) demon- 
strated a significant relation between the 
magnitude of client attraction toward the 
therapist and certain clearly observable and 
therapy relevant, overt, client behaviors. 

There is evidence to suggest that the spe- 
cific nature of these relationship maintaining 
forces acting upon the client may include such 
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participant characteristics as client depend- 
ency and the therapist’s expectations regard- 
ing a favorable therapeutic outcome. In a 
discussion of dependency in psychotherapy, 
Dollard and Miller (1950) note that therapy 
is often facilitated by initial client depend- 
ency. According to their formulation, the 
client brings to the therapeutic situation a 
desire to please the therapist, this desire being 
considered one of the main forces helping the 
client overcome the initial anxieties associated 
with therapy. As therapy progresses the client 
is expected to grow in independence, since he 
need no longer rely on pleasing the therapist 
as his only motivation for continuing in 
therapy. 

The viewpoint that dependent clients be- 
come more independent after the successful 
completion of psychotherapy is also examined 
by another line of investigation. Studying the 
present-self and ideal-self descriptions of psy- 
chiatric patients, Fordyce (1953) found that 
those patients who described themselves as 
dependent stated that they would ideally like 
to see themselves as being more independent. 
Since Rogers and Dymond (1954) and others 
report that successful psychotherapy produces 
an increased congruence between present-self 
and ideal-self descriptions, it seems reasonable 
to expect that, after a course of successful 
psychotherapy, clients with pretherapy de- 
pendent self-descriptions should see them- 
selves as growing in independence. 

The therapist’s expectation of patient im- 
provement, a second potentially important 
relationship maintaining variable, has been 
demonstrated by Goldstein (1960b) to affect 
significantly the amount of improvement the 
patient reports as having taken place and 
also the duration of psychotherapy. Kelley 
(1949), Rosenthal (1959), and Ulenhuth, 
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Canter, Neustadt, and Payson (1959) have 
also demonstrated the potency of participant 
expectancies in two-person interactions. 


Hypotheses 


1. Client pretherapy attraction to the psy- 
chotherapist varies: (a) positively with client 
pretherapy dependency, and (6) positively 
with client over-therapy movement toward 
independence. 

2. Client pretherapy attraction to the psy- 
chotherapist varies positively with the latter’s 
expectation of client improvement. 


METHOD 


Two treatment conditions were utilized in the 
present investigation: therapy (experimental group), 
and no-therapy (control group). Thirty clients and 
10 therapists participated. Most of the clients were 
undergraduates in attendance at the Pennsylvania 
State University who had sought psychotherapy 
at the University Psychological Clinic. Clients were 
randomly assigned to the two treatment conditions 
and the 15 clients in the experimental group were 
then randomly assigned to therapists. Each therapist 
met with his client(s) two times per week for indi- 
vidual, 50-minute sessions. The 15 control clients 
were placed on a waiting list and did not participate 
in formal psychotherapy during the 15 session dura- 
tion of the investigation. 

The 10 therapists employed in this study had all 
completed their formal predoctoral training in psy- 
chotherapy, including an approved internship. Eight 
therapists had yet to fulfill all the requirements for 
the PhD degree, while two had already received 
their PhD degrees. The therapists, all males, ranged 
in age from 24 to 37, with a median age of 32. 
When asked to describe their own orientation to 
therapy, none of the therapists expressed a strong 
preference for any particular “school” of psycho- 
therapy. All stated that their approach would vary 
according to the client and the situation. The 
therapists were employed by either the Psychological 
Clinic or the Division of Counseling at Pennsylvania 
State University and usually saw clients as part of 
their regular clinical case load. 


Measurement of Variables 


Client-therapist attraction. As suggested above, 
client-therapist attraction was operationally defined 
as the client’s score on the Picture Impressions Test. 
This projective technique consists of four cards 
depicting therapy-like situations to which the client 
is requested to respond in a manner analogous to 
TAT administration. Content analysis scoring (Libo, 
1956) (eg., Locomotion, Barriers to Locomotion, 
Satisfaction, etc.) was carried out independently by 
the authors with complete agreement occurring on 
83% of the client stories. For each client, an at- 
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traction score was then determined by summing his 
scores for each of his four stories. 


Dependent behavior. Dependent behavior was 
conceived of as the extent to which an individual 
prefers to have others prevent his frustration or 
punishment and provide need satisfaction (Fitz- 
gerald, 1958). In order to narrow the definition of 
dependency even further, it was decided to concen- 
trate on two aspects of dependent behavior described 
by Murray (1943) as Succorance and Deference. 
Measurement of dependent behavior occurred at two 
levels: 

Seif-descriptive dependency. To measure the extent 
to which clients attributed dependent behavior to 
themselves, the Succorance, Deference, and Auton- 
omy scales of the Edwards Personal Preference 
Schedule (EPPS) (Edwards, 1954) were adminis- 
tered. Following the suggestion of several researchers 
(Bernardin & Jessor, 1957; Gisvold, 1958; Zucker- 
man & Grosz, 1958) a total self-descriptive depend- 
ency score was computed by summing the scores 
from the Succorance and Deference scales and sub- 
tracting from the sum the score from the Autonomy 
scale. 

Overt dependency. A Situational Test of Depend- 
ency developed earlier by one of the authors 
(Heller, 1959) from a modification by Borgatta 
(1951) of the Rosenzweig P-F study (1947) was 
used in the current investigation. Borgatta developed 
a role-playing form of the P-F study in which the 
original paper and pencil situations were acted out 
by both examiner and examinee. Borgatta’s evidence 
suggests that subjects react to the role-playing form 
of this test in a manner quite similar to the way 
in which they react to real, overt, threatening situa- 
tions. The role-playing situations were further modi- 
fied so that all the situations involved a degree of 
threat to the respondent. An additional modification 
was development of a forced-choice rather than an 
open-ended method of responding. 

Therapist expectation. Therapist expectation of 
client personality change was generally defined as 
the feelings held by the therapist relating to the 
anticipated nature and intensity of his client’s per- 
sonality problems upon completion of the latter’s 
psychotherapy. Operationally, this variable was de- 
fined (Goldstein, 1960b) as the difference between 
the therapist’s ordering of personality problem Q sorts 
when he is instructed to sort them under two dif- 
ferent orientations: (a) according to the status in 
which he, the therapist, expects his client’s problems 
to be upon completion of psychotherapy; and (b) 
according to the manner in which he views his 
client’s problems at the time of sorting, i., his 
present perception of his client. 

The Picture Impressions, EPPS, and the Situa- 
tional Test of Dependency were individually admin- 
istered to all clients immediately prior to their first 
therapy session and immediately following their 
fifteenth session. If a therapy client dropped out of 
therapy before his fifteenth session, he was “post- 
tested” at the time of dropout. When this occurred, 
a control client who had been in the wait group for 
the same period of time as the experimental client 
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had been in therapy, was also tested. The therapists 
completed their sortings after every 5 sessions for 
15 sessions. 


RESULTS AND DiscussION 

The correlations, for both experimental and 
control clients, between pretherapy attraction 
and the dependency scores obtained pre- and 
posttherapy, as well as the resultant depend- 
ency difference score, are presented in Table 1. 

The pretherapy correlations in Table 1 
indicate a significant relationship between 
client’s attraction and both self-descriptive 
and overt dependency. Those individuals who 
wrote stories to the Picture Impression cards 
indicating that they anticipated positive 
gratification from therapy, described them- 
selves before therapy as more dependent ac- 
cording to the EPPS, and also acted more 
dependently on the Situational Test of De- 
pendency. This finding lends support to the 
contention of Dollard and Miller that initial 
client dependence can act in ways that main- 
tain the early stages of the psychotherapeutic 
relationship. 

The hypothesized relationship between posi- 
tive attraction to therapy and movement 
toward self-descriptive independence over 
therapy is also supported. Those clients who 
are positively attracted see themselves as be- 
coming more independent as therapy pro- 
gresses. Of additional interest is the fact that 
a distinction can be made between self- 
descriptive and behavioral changes toward 
independence. While the attracted clients saw 
themselves as becoming more independent, 
this relationship was not found on the overt 
behavioral measure. Behaviorally, the at- 
tracted clients in the experimental group were 
still dependent at the time of the posttherapy 
testing, ie., after 15 sessions of psycho- 
therapy. It appears that attracted clients may 
be set to see themselves as changing, although 
their interpersonal interactions remain rela- 
tively constant. The motivation for this 
change in self-descriptive dependency may 
well be the desire to please the therapist and 
thus say what they think the therapist expects 
or would like them to say. Working with a 
sample of psychotherapists who received their 
training at the same university as the thera- 
pists of the present study, Peterson, Snyder, 
Guthrie, and Ray (1958) have demonstrated 
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TABLE 1 


CORRELATIONS BETWEEN CLIENT PRETHERAPY 
ATTRACTION AND DEPENDENCY 


Group 
Pretherapy Attraction All 
with: Subjects 


Experimental Control 


Pretherapy 


EPPS dependency 
situational dependency 


*** 
491 *** 


Pre-post difference 


EPPS dependency 
situational dependency 


Posttherapy 
EPPS dependency 
situational dependency 


* Data obtained at this time preceded the assignment of 
subjects to experimental and control groups, hence prether- 
apy r's were calculated across ali 30 subjects. 

* Significant at .06 level. 

** Significant at .05 level. 
*** Significant at .01 level. 


that therapists of this training background 
show a great deal of attention to their clients 
when the latters’ remarks demonstrate a pref- 
erence to let others provide for the satisfac- 
tion of their needs. Should this differential 
between self-descriptive and overt behavioral 
test findings be corroborated in further re- 
search, serious doubt would be cast upon the 
common procedure of evaluating the effective- 
ness of psychotherapy by the exclusive use of 
self-descriptive measures. 

Somewhat more puzzling is that the control 
clients, who received no formal psycho- 
therapy, showed almost the same relation be- 
tween attraction and self-descriptive move- 
ment toward independence. It should be noted 
that the control group as a whole showed no 
movement over therapy, either in the inde- 
pendent or the dependent direction. But still, 
when individual variation is considered, those 
in the control group who described themselves 
as becoming more independent over time were 
positively attracted toward therapy, while 
those who described themselves as becoming 
dependent tended to be those who were nega- 
tively attracted. The investigators can only 
speculate concerning the reasons for this rela- 
tionship in the control group. Our present 
inclination is to view attracted clients as indi- 
viduals who would interpret even minimal 
clinic contact (such as is involved in testing 
sessions) as benefiting them in some way. 


= 
J 
— —.774*** —.508* 
065 —.009 
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TABLE 2 


CORRELATIONS BETWEEN CLIENT ATTRACTION AND 
THERAPIST EXPECTANCY 


Variables correlated r 


Preattraction and: 


TE;* 427 

.144 

TE,s° .199 
Difference-attraction and: 

TEs .619* 

TEw —.137 

TEs A418 
Postattraction and: 

TEs .535* 

.162 

.096 


* Fifth session therapist expectancy. 

» Tenth session therapist expectancy. 

¢ Fifteenth session therapist expectancy. 
* Significant at .05 level. 


A study by Barron and Leary (1955) ap- 
pears to offer support for this contention. 
They state, with regard to wait-list control 
clients: 
simply having committed oneself to participating 
in psychotherapy, and having had a reciprocal com- 
mitment from a clinic to afford psychotherapy, even 
though not immediately, represents a breaking of 
the neurotic circle. A force for change has already 
been introduced. In addition, the initial interview 
and the psychological testing may themselves be 
psychotherapeutic events, since during such sessions 
the patient makes some efforts to confront himself 
and his problems more objectively than he has in 
the past (p. 244). 


A recent investigation by Goldstein (1960a) 
supports this finding. 

Table 2 presents the correlations between 
therapist expectancy of client improvement, 
as obtained at five session intervals, and client 
pre- and posttherapy attraction, as well as the 
change in attraction over the course of 
therapy. 

These findings indicate a significant rela- 
tion between the expectation of client im- 
provement held by the therapist early in 
therapy, and both the change in client at- 
traction over the course of therapy and the 
magnitude of client attraction subsequent to 
the fifteenth session. None of the other cor- 
relations presented in Table 2 reached ac- 
cepted levels of significance. 
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In addition to offering partial support for 
the hypothesis that therapist expectancy is a 
relationship maintaining aspect of the psycho- 
therapeutic interaction, these findings raise 
the question as to why this should only be the 
case with regard to the therapist’s early 
expectations (fifth session), and not his tenth 
and fifteenth session anticipations of client 
improvement. A study by Good (1952) fur- 
nishes a basis for differentiating “early” and 
“late” therapist expectations, a differentiation 
which appears to shed light on the present 
study’s findings. He states: 


Support was found for the hypothesis that in a 
relatively novel situation, generalization effects 
chiefly determine the expectancy held by S, and that 
as S has more experience with the specific task, 
expectancies develop which are a function of this 
task (p. 99). 


In the present study, the expectations held 
by the therapist at the fifth session regarding 
client personality change may be more a 
function of his perceived success and failure 
with past clients than his feelings concerning 
his present client’s progress. By the tenth 
session, however, their psychotherapeutic in- 
teraction is less “novel” and the major 
determiner of the therapists’ expectations may 
have shifted from generalization effects to task 
effects. Kelly (1955), Lennard and Bernstein 
(1960), and Rotter (1954) have also noted 
significant temporal shifts in therapist ex- 
pectancies over the course of psychotherapy. 

The basis for the failure of late-therapy 
therapist expectations to relationship 
maintaining would appear to be an important 
question for further research. 


SUMMARY 


The present investigation attempted to 
determine the extent to which client depend- 
ency and therapist expectation of client im- 
provement can be considered relationship 
maintaining variables in the psychothera- 
peutic interaction. Thirty clients undergoing 
psychotherapy at a University Psychological 
Clinic were randomly assigned to “therapy” 
and “no-therapy” conditions and the 15 
“therapy” clients were randomly assigned to 
10 therapists. The therapists, for the most 
part, were advanced graduate students in clin- 
ical psychology at the Pennsylvania State 
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University. Testing, on measures developed or 
modified for the current study, took place pre- 
and posttherapy for all clients and after every 
five sessions for the therapists. Results of the 
study indicated a strong positive relation be- 
tween client pretherapy attraction to the 
therapist and: (a) both client self-descriptive 
and behavioral dependency before therapy 
and, () client self-descriptive, but not be- 
havioral, movement toward independence over 
the course of therapy. A similar but less 
marked relationship occurred in control group 
clients, offering further evidence for the 
therapeutic nature of such nonspecific clinic 
contacts as the intake interview and psycho- 
logical testing. 

An unexpected finding was the relatively 
high degree of relation between pretherapy 
attraction in therapy clients and their overt 
posttherapy dependency, a finding at variance 
with that obtained on self-descriptive instru- 
ments. In addition to other implications, this 
finding suggests caution in interpreting re- 
sults in psychotherapy research which are 
based solely on one level of measurement. 

Finally, partial support was obtained for 
the hypothesis that favorable therapist ex- 
pectation of client improvement can function 


to maintain the therapeutic relationship. 
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ON SELF-CONSISTENCY: 


A REPLICATION AND EXTENSION ' 


ROSALIND DYMOND CARTWRIGHT 2 


University of Colorado 


In a previous paper (Cartwright, 1957), 
it was proposed that one of the common 
themes of psychotherapy with psychoneurotic 
patients is a search for a stable identity. In 
that paper the author put the problem in 
these terms: 


If selves or roles are thought of as characteristics 
which are particular to specific interactions or 
classes of interaction, then the area of self is that 
core of characteristics which is common over N 
situations. In these terms the pre-therapy client can 
be thought of as one whose self is very small. He 
seems to have diverse selves in relation to others 
with whom he interacts, but the common core, the 
essential me is so restricted in scope as to leave him 
puzzled by the question “Who am I?” 


It was hypothesized that the self the pre- 
therapy client describes would differ con- 


siderably depending on his _ interactional 
referent, while the posttherapy client’s self- 
description would have more stability in 
terms of a larger core of characteristics which 
remain consistent in emphasis, regardless of 
the particular other involved. Of course it is 
most likely that the healthy individual will 
retain some variability, that there is an 
optimal point here short of rigidity. 
Specifically, the 1957 study hypothesized 
that, if asked to select three people of major 
importance to them and to describe them- 
selves on a Q sort as they are with each of 
these people in turn, a group of clients would 
show more variability among these sortings 


1The data for this study were collected under a 
research grant from the Ford Foundation to the 
University of Chicago Counseling Center. 

2 The author wishes to thank John L. Vogel, now 
of the Department of Psychiatry, University of 
Washington, Seattle, who did all of the testing of 
the subjects. 


before therapy began than after therapy had 
been completed; successful clients would 
change more than failure clients, and clients 
before therapy would have more variability 
in their self-descriptions than a group of con- 
trol subjects, but would not differ from 
controls after therapy had been concluded. 

Although the 1957 study supported the 
hypothesis outlined above, there were several 
reasons why it was felt that the study should 
be replicated and extended. (a) The sample 
was a very small one: 10 experimental sub- 
jects (5 successful and 5 unsuccessful cases) 
and 10 controls. (6) The controls were only 
tested at one point in time, as it was as- 
sumed that they would not change in self- 
consistency. This should be experimentally 
established. (c) The relations among these 
various self-to-other sortings were not tested. 
For these reasons the present study was 
undertaken. 


METHOD 
Sample 


The present sample consists of 19 experimental 
subjects * and 20 controls. Originally 30 subjects who 
applied to the University of Chicago Counseling 
Center for psychotherapy were asked to participate 
in this research study. Of these, eight discontinued 
therapy before completing the minimum number of 
interviews, which was set at six. An additional three 
cases of those completing therapy were lost by 
failure to make contact with them for posttherapy 
testing. 

The control sample was selected to match the 
experimental sample in age, sex distribution, and 
student and nonstudent status. As they were to be 
controls for the variable therapy, they were selected 
as never having had, not now having, and having 

3 This sample is identical to that reported on in 
the paper by Cartwright and Vogel (1960). 
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no immediate intention of having psychotherapy. 
Table 1 compares the samples in the two studies. 


Procedure 


The procedure differed slightly from that used 
in the previous study in that the subjects were first 
asked to complete a “plain” self-sort. The 100 item 
Butler and Haigh Q cards (1954) were again used 
as the primary instrument. Each subject was first 
instructed to sort the cards to describe himself as 
he is today from those items which are most like 
him to those which are least like him according to 
the required 1, 4, 11, 21, 26, 21, 11, 4, 1 distribu- 
tion. Next a TAT was administered. Following this 
the procedure was identical to that previously re- 
ported. The instructions to the subjects were: 


I would like you to think of three people who are 
very important to you. They can be of any age 
or relation to you like father, mother, child, 
friend, or boss. They can be people you like or 
dislike, or some of each. You don’t have to pick 
on any basis except they be people who are very 
important to you and with whom you have real 
interaction. 


The subject was then given a code sheet on which 
to enter code names for his choices opposite their 
role relation to him. He was then given a deck of 
Butler and Haigh Q cards and instructed to sort 
them to describe himself as he is in his relationship 
to the first person on his list. 


Sort these cards to describe yourself as you 
see yourself to be in your relationship to 
, from those cards which are least like 
you as you are with him (her) to those that are 
most like you in your relations with him (her). 
Keep this question in mind while you are work- 
ing: If I were only the person I am _ with 

, what would I be like? 


Following this sorting, the subject was given 
another set of the same cards and asked to sort 
again to describe himself as he is with the second 
person on his list. Then this was repeated a third 
time for the third person of major importance to 
him. All in all then, there were four sortings of the 


TABLE 1 


COMPARISON OF THE EXPERIMENTAL AND CONTROL 
SAMPLES IN THE 1957 AND 1959 Stupy 


Number of 
Age interviews 
Mean Range 


Sample Mean Range 


E 1957 
C 1957 
E 1959 
C 1959 


18-42 & 65 
19 44 
is-41 : 8-97 
19-53 


TABLE 2 


COMPARISON OF THE MEAN ITEM VARIANCES FOR THE 
1957 anp 1959 StrupiEes 


Sample 


Pretherapy 


Posttherapy 
E 1957 .969 .766** 
E 1959 .939 .763* 
C 1957 .736 Not tested 
C 1959 .763 or 
S 1957 849 
S 1959 940 580*** 
F 1957 1.088 .936 
F 1959 .938 .928 


Significant at .05 level. 

** Significant at .02 level. 
*** Significant at .01 leve:. 
**** Significant at .001 level. 


same Q items using the same distribution: one self- 
sort and three of his conception of himself in three 
different relationships. 

For the experimental subjects these procedures 
were repeated when they had completed their 
therapy. For 10 of the control subjects they were 
repeated after a period of 6 months (to match the 
mean of 18 weeks of therapy). 


ANALYSIS AND RESULTS 


To test the major hypothesis of increased 
self-consistency following psychotherapy, the 
same method reported previously was em- 
ployed. Briefly the mean item variance over 
the three self-in-relationship sortings was 
computed by a method developed by Cart- 
wright (1956a). The hypothesized differences 
between these means were then tested using ¢. 
The ¢ between pre- and posttherapy mean 
for the experimental (E) group was 2.100, 
significant at the .05 level.* 

The ¢ between the first and second testing 
of the control (C) group was 4.394, signifi- 
cant at the .001 level. 

From these results it would appear that 
a second testing of either experimental 
(therapy) or control (no therapy) subjects 
will show a significant increase in consistency 
of sorting. 

In the 1957 study, five successful therapy 
cases were found to decrease their mean 
item variance significantly while five failure 
cases did not. Using the same criterion of 
successful therapy for the present sample 


4 All significance levels are for two-tailed tests 
unless otherwise stated. 
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TABLE 3 


Mean Q ApbjusTMENT Scores OF SELF-SorT OF 
CONTROL, SuCCESS, AND FAILURE GROUPS 


Direction of 
change in ad- 
justment and 

self-consistency 


Q Adjustment 
score 


Different 


Group Test 1 Same 


Test 2 


49.3 
36.7 
36.1 


50.5 
49.4**** 
38.3 


**** Significant at .001 level. 


Control 
Success 
Failure 


yielded only 3 cases of success out of the 19. 
This criterion (based on therapists’ ratings 
of adjustment, change in adjustment, and 
success of the therapy) was, therefore, re- 
luctantly abandoned for one that would split 
the group more equally. The new criterion 
was based on change in test performance 
from pre- and posttherapy time. The 19 ex- 
perimental subjects were ranked on the ex- 
tent of their improvement on two test scores: 
the Q Adjustment score (Dymond, 1954a) 
and a mental health rating of the TAT 
(Dymond, 1954b). The 9 subjects who im- 
proved most on this combined ranking of 
change were called the success (S) group and 
the 10 who ranked lowest were called failures 
(F). Table 2 compares the mean item vari- 
ances for the groups and subgroups involved 
in the original and in this replication study. 
It is clear from Table 2 that failures as a 
group do not increase in self-consistency on 
a second testing. 

While the extent of the replication of the 
mean scores in the two studies is striking, 
particularly since such small groups were in- 
volved, the meaning of the change for the 
therapy subjects is obscured by the highly 
significant change in the group which was not 
in treatment. Testing for the significance of 
the difference between the changes in the S 
group and the C group yielded a nonsignifi- 
cant ¢ (1.644). However, inspection of the 
array of change scores showed that the range 
of changes was considerably greater for the 
S group than for the Cs. An F ratio was 
computed and found to be 3.30, significant 
at the .05 level. A comparison of the disper- 
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sion of the changes between the total experi- 
mental group and the C group showed the 
E group’s dispersion to be much wider, of 
course. The F ratio here was 13.19, signifi- 
cant beyond the .001 level. These findings 
of the significant difference in the variances 
of changes between E and C and between S 
subgroup and C is similar to the finding 
reported by Cartwright (1956b) on his re- 
analysis of the data reported by Barron and 
Leary (1955). He concluded that clearly 
there was more change with regular formally 
defined therapy than without it, but in both 
directions. Similarly, in this study it can be 
argued that although both E and C groups 
changed significantly in the direction of in- 
creased self-consistency over similar time 
periods, those in therapy showed wider 
changes. It seems that therapy has more 
impact to change persons on this measure 
than the unknown influences which brought 
about the change in the no-therapy group. 

Further light is shed on the meaning of 
these changes by looking at the Q Adijust- 
ment scores of the plain self-sort. If the 
change toward greater consistency is to be 
evaluated positively, the more stable self 
should be one the subject can live with more 
happily. Therefore it should be accompanied 
by a higher Adjustment score of the self- 
description. This is not a necessary relation- 
ship. It is possible for the self-in-relationship 
to other sortings to become less varied with- 
out an accompanying change towards better 
adjustment. This is what appears to happen 
in the control cases. ' 

Several important points emerge from an 
inspection of Table 3. 

1. The C group does not improve in ad- 
justment despite the fact that it does in- 
crease in self-consistency. The improvement 
in the Adjustment score of the E group as 
a whole is significant beyond the .01 level; 
in the S subgroup at the .001 level. 

2. In only 5 of the 10 C cases do the two 
measures move in the same direction, whereas 
the ratio is 15 of 19 for the E group, and 
9 out of 9 for the S subgroup. It seems likely 
then that the increase in self-consistency for 
the C cases must be interpreted differently, 
perhaps due to less motivation for making 
fine distinctions on the second test. The in- 
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structions certainly set the subjects to look 
for differences in their ways of viewing them- 
selves in their interactions. If the subject is 
largely consistent in his self-picture over 
various interactional settings, the instructions 
might cue him to make small distinctions 
which are relatively unimportant to him, and 
so not maintained on Test 2. If this explana- 
tion is tenable, it would be expected that the 
C group would differ from the S group in 
various ways. (a) They would have fewer 
items with large discrepancies on Test 1 than 
S subjects, and more small discrepancy items. 
(6) They would show less proportional change 
in their large discrepancy items than the S 
group. (c) The increased consistency of the 
C subjects would not represent any major 
change in self-definition. Items which are 
consistent on Test 2 which were not previ- 
ously consistent will have changed less in 
their position of importance to the C subjects 
than to the S cases. 

To test these notions, the extent of the 
discrepancy over the three sortings for each 
item was tabulated. Items with a discrepancy 
of 0-2 Q points were called Small Difference 
items. Items with discrepancies of 3 or more 
Q points were called Big Difference items. As 
these predictions were all directional in na- 
ture, one-tailed tests were used. All were 
confirmed at the .05 level or better. 

Looking at the new consistent items for 
the two groups, it was found that the S group 
drew their new self items from a different 
source than did the C group. If items sorted 
at the extreme scale positions (0,1,2 = least 
like me; and 6,7,8 = most like me) are as- 
sumed to be more important to the person 
than those sorted at the middle of the forced 
normal distribution, an item which was not 
previously an important self item (extreme 
in placement and consistent in position) 
could have been previously extreme but in- 
consistent or previously central. Table 4 
shows the previous position of the new ex- 
treme consistent items for the S and C 
groups. 

Without doubt the new stable elements of the 
self which are of importance to the successful 
therapy people are significantly more often 
items which have changed with respect to 
their importance. It is more often the case 


TABLE 4 


Previous Position oF New Extreme 
CONSISTENT ITEMS 


Success group Control group 


Extreme 


Extreme 


Central 


Central 


UN N 


Central-Extreme 
Control: Central-Extreme 
Central: Success-Control 
Extreme: Success-Control 


Note.—Success: 


*** Significant at .01 level or beyond. 


that these items have gained an importance 
over the therapy period, rather than that 
they were important previously and have only 
gained in stability or consistency. For the 
controls, the reverse is true. The new stable 
items at the extremes of the sorts on Test 2 
were significantly more often also at the ex- 
tremes on Test 1, although at that time in- 
consistently sorted. Here, then, is another 
indication that the increased consistency of 
these two groups represents different proc- 
esses. The successful therapy cases have 
changed their definitions of themselves in 
their important relationships, while the con- 
trols seem only to have consolidated the 
definitions previously made. 

If this proposed explanation holds, that 
the increased consistency of the two groups 
is essentially different, that therapy has an 
impact which shifts the meaning of the self 
as well as exercising a consolidating influ- 
ence, then the particular selves that the S 
group describes should reflect the shift in 
meaning in their Adjustment scores in their 
relation to others. If the C group’s con- 
sistency is due only to a lack of motivation 
for making fine distinctions, then no essential 
change of the adjustment of self in relation 
to others would be expected. Since each 
subject was free to choose the relationships 
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TABLE 5 


ADJUSTMENT SCORES FOR SELF-IN-RELATIONSHIP TO Moruer, FATHER, AND 
OTHERS OF SUCCESS, FAILURE, AND CONTROL Groups 


Father Other 
Group Test 1 Test 2 Test 1 Test 2 Test vii Test 2 
Success 38.3 49.8 46.0 53.2 44.4 a2 
Failure 32.2 37.2 39.4 36.2 43.3 34.8 
Control 52.7 54.8 42.3 45.5 46.9 47.9 


Note.—Success: Mother Testi-Mother Test 2 


4.063 *** 


t= 
Others Test 1-Others Test 2 t = 3.660*** 
Failure: Others Test 1-Others Test 2 t = 2.260* 
Mother Test 1-Others Test 1 t = 2.206* 
Control: Mother Test 1-Father Test 1 t = 2.369* 
Mother Test 2—Father Test 2 = 1.917 
Mother: Success Test 1-Control Test 1 $ = 3.237*** 
Failure Test 1-Control Test 1 t = 4.053*** 
Success Test 2-Failure Test 2 4 = 1.930 
Failure Test 2—Control Test 2 ¢ = 3.154*** 
Father: Success Test 2—Failure Test 2 $ = 3.157*** 
Others: Failure Test 2—Control Test 2 t = 2.689** 
Success Test 2-Failure Test 2 t = 2.409* 


* Significant at .05 level. 
** Significant at .02 level. 
*** Significant at .01 level. 


of importance to him, it is hard to make 
strict comparisons. For this reason relation- 
ships were only categorized as father, mother, 
and others. Table 5 gives the mean Adjust- 
ment scores for these categories at the two 
testing points. 

Some of the significant relations were: 
(a) S cases improve in the adjustment of 
the relationship to their mothers between 
Test 1 and Test 2. Neither F cases nor C 
cases show any significant change. (6) Both 
E groups are more poorly adjusted in their 
relationship to their mothers on Test 1 than 
C cases. S cases are not significantly different 
after therapy from the adjustment level of 
the C group on their second test. S cases 
border on being significantly higher in ad- 
justment on their second test than F cases 
who are still significantly poorer in mean 
adjustment than Cs. (c) There is no sig- 
nificant difference between the adjustment 
level of self-in-relationship to mother and 
father for either E group at either time. 
However, the self-in-relationship to father of 
the C group is significantly lower than their 
self-in-relationship to mother on Test 1, and 
this borders on being a significant relation- 
ship on Test 2 as well. (d) None of the 
groups changes significantly in the adjust- 
ment of the self-in-relationship to father. The 
F group is lowest of the three in mean ad- 


justment on Test 1, and this drops on Test 2 
to be significantly lower than the S group. 
(e) S cases improve their mean adjustment 
score in relation to others significantly over 
the therapy time, whereas F cases change 
significantly towards poorer adjustment. On 
Test 2 Fs are significantly lower in adjust- 
ment to others than both Ss and Cs. 

It would appear from these results that 
patients, as distinct from controls, come to 
therapy with disturbance in their self-to- 
mother relationship. Patients who  subse- 
quently do not succeed in therapy start out 
with poor adjustment of their self-image-in- 
relation to both parents while those who 
succeed start with strength from their self-to- 
father image. 

Since it seemed likely that some interesting 
conclusions could be drawn from these results 
concerning the kinds of problem relationship 
that persons bring to therapy and those which 
do and do not get resolved with therapy of 
this kind, it seemed important to check these 
data as far as possible with those from the 
previous study. This is difficult since a dif- 
ferent criterion was employed and the total 
number of cases was only 10. For this latter 
reason, the comparison was made for the 
total E group. 

The first study shows some of the same 
patterns as found in the present sample: 
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TABLE 6 
COMPARISON OF MEAN ApJuSTMENT ScorES FOR 1957 
AND 1959 StTuDIES 


Mother Father Other 


Sample Testi Test 2 ‘es Test 2 Test 1 


E 1957 31.4 ‘ 30. 41.0 
E 1959 35.0 
C 1957 46.6 38. 

52.7 45.5 


46.0 
C 1959 


(a) The mean adjustment of the C group’s 
self-in-relation to father is again lower than 
their mean adjustment of self-in-relation to 
mother, although this does not reach sta- 
tistical significance. (6) Patients in the 1957 
study were also lower in mean adjustment 
of self-in-relation to mother than were con- 
trols. Speaking loosely then, both studies 
show that clients come to therapy with 
mother problems which are not present in 
control subjects. Control subjects may have 
father problems, but these do not seem to 
be sufficiently disturbing to bring them to 
therapy. The good adjustment of the self-in- 
relation to the mother seems to be the dis- 
tinguishing mark between patients and those 
who are not patients. However, it was 
thought that perhaps the sex of the subjects 
might be an important variable in these 
relationships. Table 7 gives the mean adjust- 
ment scores for the two sexes in the current 
study. 

Obviously from Table 7 it is clear that 
clients of both sexes have problems of ad- 


justment of the self-in-relation to their 
mothers, and this is not true for either sex 
group of control subjects. Also, it is obvious 
that of the clients, it is the males that have 
father problems that bring them to therapy. 
Further, therapy of this kind improves the 
adjustment of the self-in-relationship to the 
mother for both sexes, but not significantly 
for males, and there is little change in the 
father relationship for either sex. 


SUMMARY AND DISCUSSION 


This paper has reported a study designed 
to replicate and extend a previous study 
of the effects of psychotherapy on self- 
consistency. In both experiments the samples 
consisted of psychoneurotic subjects who had 
applied to the University of Chicago Counsel- 
ing Center for client centered psychotherapy. 
Both studies employed matched control sub- 
jects who were not motivated for therapy. 
The original study found the therapy group 
to have lower self-consistency than controls 
before therapy, and to have increased their 
consistency at the completion of therapy to 
the level of the control group. 

The replication study confirmed the find- 
ings of the first, but found that controls also 
increase their self-consistency on Test 2 sig- 
nificantly over their Test 1 level. An analy- 
sis of the kinds of change involved showed 
different processes at work. The increased 
consistency of the controls did not represent 
much redefinition of the self-in-relationship 


TABLE 7 


COMPARISON OF MEAN ADJUSTMENT SCORES OF MALES AND FEMALES 


Mother 


Sample Test 1 Test 2 


Father Other 


Test 1 Test 2 


Test 1 Test 2 


E 1959 


Males (N = 9) 
Females (NV = 10) 


C 1959" 
Males (N = 10) 50.1 
Females (N = 10) 51.3 


Note.—Males: Mother Test 1-Test 2 
Females: Mother Test 1-Test 2 


t = 1.473 
= 4.997*** 


* All 20 cases that were given Test 1 are reported here to maximize the NV. 


*** Significant at .01 level. 
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to the important others but rather a con- 
solidation of the definitions previously made. 
In addition, there was no accompanying 
change in the adjustment level of the self 
for the control group. In the successful 
therapy group the new self-consistent items 
showed significantly more shift in importance 
in the various relationships so that it could 
be stated that redefinition was taking place. 
Moreover, the overall adjustment level of 
the self-sort improved significantly. 

Looking at the particular interactional 
referents, it was found that clients were more 
poorly adjusted in their self-to-mother image 
than controls, and this relationship held for 
both studies and for the experimental group 
when stratified both by outcome of therapy 
(success and failure), and by sex (males and 
females). This appears to be a reliable dis- 
tinguishing characteristic of subjects who 
apply for psychotherapy (at least of this 
kind), and who stick with it beyond six 
interviews. 

Control cases in the present study were 
significantly poorer in adjustment of their 
self-image in relation to their fathers than to 
their mothers, and this bordered on being a 
significant difference for the original sample 
of controls as well. It seems likely, then, that 
it may be “normal” to have a more poorly 
adjusted image of oneself in relation to father 
than to mother, but providing the self-to- 
mother image is sufficiently well adjusted, 
there is no internal pressure to seek psycho- 
therapy. This finding held equally for both 
sexes of control subjects in the present study. 
Unfortunately, the N in the previous study 
was too small to test it for that group. 

In the present study, males were found to 
enter therapy with significantly poorer ad- 
justment of self-in-relationship to father than 
did females. Successful cases improved the 
adjustment of their self-sort in relation to 
their mother, but there was no improvement 
of adjustment of the self-sort in relation to 
the father for any group in this study. This 
type of therapy, then, seems to be effective 
in resolving mother problems, but does not 
appear to change the self-to-father image. 


Perhaps clients with mother problems self- 
select client centered therapy as meeting their 
needs of working out their problems of ad- 
justment to a maternal figure. The therapist 
in this type of therapy is supposed to be 
warm, accepting, empathic, understanding, 
nondirective and to stress the feeling aspects 
of the communications: to play, essentially, 
a very feminine role. Or perhaps most 
psychoneurotic outpatients would be found 
to have disturbance in the mother area and 
some, namely, males, in the father area as 
well. But those who are in a more directive 
type of therapy might be found to have a 
reversed pattern of changes: improvement in 
the father area and little change in the 
mother area. These questions remain open for 
further research evidence. 
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SEXUAL SYMBOLIC RESPONSE IN PREPUBESCENT 
AND PUBESCENT CHILDREN 


AUSTIN JONES? 


University of Pittsburgh 


The Freudian hypothesis that pointed, 
elongated objects are symbolic of the penis 
and that rounded or enclosing objects are 
symbolic of the vagina has received inferential 
experimental support in a recent study (Jones, 
1956). Adult subjects were asked, essentially, 
to ascribe masculine or feminine personality 
to each of a series of simple geometric figures. 
The figures were of two classes: pointed or 
elongated, and rounded or enclosing. Subjects 
responded in a manner consistent with the 
Freudian hypothesis significantly more often 
(p < .001) than would be expected by chance 
alone. Male subjects were found to respond 
significantly more consistently with the hy- 
pothesis than females. In discussing the latter 
finding, it was noted that available evidence 
(Kinsey, Pomeroy, Martin, & Gebhard, 1953) 
suggests that the sexual response of men is 
more readily conditioned to a wide variety of 
visual stimuli than is that of women. The 
symbols employed in the study were modifica- 
tions of those used earlier by Levy (1954), 
who failed to find support for the Freudian 
hypothesis in an experiment involving the 
matching of symbols and male and female 
given names, and the paired associate learning 
of like-sex and unlike-sex pairs of symbols and 
names. The subjects of that study were fifth 
grade children, most of whom were presum- 
ably prepubescent. In light of the subsequent 
positive findings with adult subjects, it ap- 
pears possible that age differences in the 
sexual symbolic response might account partly 
for the negative results of the earlier study. 
Developmental studies of sexual symbolism 
of an experimental nature are apparently 
totally lacking. 

1The author wishes to thank William Bendig, 


Ruth Goodman, and Melvin Manis for helpful com- 
ments and advice. 


The purpose of the present study was to 
assess the strength of the sexual symbolic 
response in children of various ages up to 
the time, roughly, of pubescence. The attitude 
of the study was exploratory; it is possible to 
formulate rather contradictory hypotheses as 
to the function relating age and strength of 
response. One view would hold that, symbolic 
behavior being generally a matter of socially 
acquired learning, the strength of the sexual 
symbolic response ought to increase with in- 
creasing age and increasing exposure to the 
socially determined symbols. Also, symbolic 
behavior generally may be conceived of as 
evidence of socially imposed inhibitions of 
more direct expression; the young child, 
having lived fewer years under the inhibiting 
influence of civilization, would be expected to 
show less use of symbolism and relatively 
greater directness of sexual expression than 
would the older child or adult. A different 
hypothesis emerges when we consider the data 
regarding drive level and generalization phe- 
nomena. As drive increases, the height of 
the generalization gradient is raised and dis- 
crimination correspondingly reduced. Beach 
(1942), for example, increased the sex drive 
of a group of male rats by injections of 
testosterone, with the result that the animals 
made sexual responses to a markedly increased 
range of stimuli—thus failing to discriminate 
what function “normally” as sexual and non- 
sexual stimuli. Similarly, we would expect 
that children approaching pubescence, i.e., 
experiencing an increase in sex drive, would 
demonstrate a decrement in discrimination 
between male and female sexual cues. Their 
responses to symbols would be less consistent 
with the Freudian formulation than those of 
children somewhat younger. The following 
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experiment was designed to clarify the issues 
involved in these contrasting hypotheses. 


METHOD 
Subjects 


Since the intent of the experiment was to study the 
sexual symbolic response as a function of degree of 
sexual maturation, a strictly chronological arrange- 
ment of subject groups which includes both sexes is 
less meaningful than a categorization which takes 
into account the different maturation schedules of 
males and females. Due to the onset of puberty ap- 
proximately 2 years earlier in females than in males, 
roughly equivalent levels of sexual maturation are 
obtained by grouping the males of a given age with 
females 2 years younger. Four such “sexual matura- 
tion” groups were constituted using boys and girls 
in a public school system. The “youngest” sexual 
maturation group consisted of second grade girls and 
fourth grade boys; the next two groups, fourth grade 
girls plus sixth grade boys, and sixth grade girls plus 
eighth grade boys. The final group was planned to be 
eighth grade girls and tenth grade boys; but adminis- 
trative permission could not be obtained for the 
utilization of high school (and hence tenth grade) 
students. Ninth grade boys were available, however, 
so that the final sexual maturation group was ap- 
proximated by pairing eighth grade girls and ninth 
grade boys. Ten boys and 10 girls were selected 
randomly at each of the four levels, making a total 
of 80 subjects. The mean ages of the five grades 
from which subjects were drawn (grades two, four, 
six, eight, nine) were 8.5, 10.5, 12.3, 14.4, and 15.5, 
respectively. Thus the subjects of the first sexual 
maturation level are approximately four years pre- 
pubescent, while at the fourth level most subjects 
of both sexes have attained puberty. 


Materials 


The stimuli in this experiment were the 10 sexual 
symbolic figures adapted by Jones (1956) from those 
used initially by Levy (1954). Five are pointed or 
elongated (“male”) and five are rounded or en- 
closing (“female”). The figures are essentially 
“abstract” or without specific object identity ; they are 
variations of circles, pyramids, rods, cubes, etc. The 
figures are reproduced in black ink on individual 
white cards. In the prior study supporting the 
Freudian view with adult subjects, one of the pre- 
sumed female symbols was shown not to be so. The 
figure was included in the present administration in 
order to make the procedures of the two studies 
comparable for clarity of discussion, although re- 
sponses to the nonfunctional figure did not enter into 
the score variable. 


Procedure 


The subjects were seen individually by the author. 
The experimental procedure was second in a series 
of other experimental procedures, allowing time for 
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the establishment of optimal rapport. Subjects were 
presented the following instructions orally. They are 
the instructions employed in the prior study with 
adult subjects except for editing to lower the 
vocabulary level. 


You know, lots of times things kind of seem like 
people to us—kind of remind us of people some- 
how. Many things around the house, for instance, 
or almost anything you know real well. Some 
things seem more like men and some things seem 
more like women. I have a set of cards here which 
have designs or pictures on them. I’m going to 
show them to you one at a time—and you tell 
me which they remind you of more—men or 
women. Just use your imagination and tell me 
the first answer you think of. Don’t stop to think 
about it; just tell me the first thing you think of. 


Following the instructions, the experimenter pre- 
sented the 10 figures in an order randomized for 
each subject. The subjects were required to respond 
within approximately 2 seconds. 


RESULTS 


The data of the experiment are presented 
graphically in Figure 1. The percentage of 
responses plotted there is the pecentage (out 
of five male symbols or out of four female 
symbols) which were consistent with the 
Freudian hypothesis. The data are plotted 
separately for male and female subjects and 
for their response to the two types of symbols, 
male and female. Data for the normal adults 
of the prior study (Jones, 1956) are plotted 
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Fic. 1. Mean percentage of responses consistent 
with the Freudian hypothesis as a function of 
maturation level. (Indicated by the grade scale ad- 
justed so as to equate maturation level for the two 
sexes. The two points plotted at the extreme right 
represent the means for the college subjects of the 
prior study.) 
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for comparison. Table 1 summarizes an analy- 
sis of variance of trends (Grant, 1956) over 
the four sexual maturation levels. The arc- 
sine transformation for proportions was em- 
ployed (Walker & Lev, 1953). Differences 
between maturation levels (linear component ) 
attained the .06 level of significance, and dif- 
ferences between symbol types the .01 level. 
Differences between subject sex groups failed 
to reach significance, as was true of the 
various interactions, 

An additional analysis of variance was per- 
formed which compared the pooled child data 
of the present study with the adult data of 
the prior one. This analysis, summarized in 
Table 2, was thus based upon two age classes 
and, as before, two sex classes and two symbol 
type classes. Scores again were the arc-sine 
transformations of the proportions of “cor- 
rect” responses. Significant differences were 
demonstrated between the two age groups and 
between symbol types (p < .01 in each case). 
The Age X Sex interaction approached con- 
ventional significance levels (p < .08). All 
other interactions were nonsignificant. 


TABLE 1 


ANALYSIS OF VARIANCE OF PROPORTIONS OF ‘‘CORRECT”’ 
RESPONSES FOR EACH Type 


Source 
Between maturation levels 
linear 
quadratic 
cubic 


7.704 
13,489 
1,967 


7,657 
3,000 


Sex X maturation level 59 


3 

1 

1 

1 

Between sexes 1 
3 

72 


Bet ween subjects within groups 


Between symbol types 7.80*** 


1 
Symbol type X maturation level 3 
Sy mbol type X sex 1 


Symbol type X maturation 
level X sex 
linear 
quadratic 
cubic 


Pooled symbol type X subjects 
within groups 


Note.—Data in the form 
proportions, 
< .06. 
< 


TABLE 2 
ANALYSIS OF VARIANCE COMPARING THE CHILD DATA 


OF THE PRESENT STUDY WITH THE ADULT DaTA oF 
THE Previous Stupy 


Source MS I 


Between ages 72,904 21.44°** 
Between sexes 320 
Age X sex 10,682 


3,400 


ate” 

Between subjects within 
groups 

Between symbol types 6,083 

3,27 


1,241 


4.90*** 

Symbol type X groups 

Pooled symbol tyne X subjects 
within groups 


Note.—Data in the form of arc-sine transformation of | 


portions of “‘correct"’ responses for each symbol type. 
*p > 05. 
< 


DISCUSSION 


The finding of principle interest has to do 
with the differences between maturation levels, 
the linear component of which was significant 
at the .06 level. An appropriate procedure, 
when findings fall very slightly short of the 
accepted .05 level, would be to replicate the 
experiment within the same population so as 
to provide a more definite evaluation when it 
is reported. Such a replication would have 
been desirable here but was not possible for 
administrative and public relations reasons. 
Consequently, the finding will be discussed 
tentatively here as significant, with the obvi- 
ous comment that additional assessments of 
the relationship are needed. 

The overall downward trend in sexual 
symbolic response over the four maturation 
levels of the present study supports the hy- 
pothesis that increased sexual drive as puberty 
approaches is accompanied by a raising of 
generalization gradients and an attendant 
decrement in discrimination between symbols 
of “maleness” and “femaleness.” The four 
maturation levels extend, roughly, from 4 
years prior to average age of pubescence, to 
1 or 2 years beyond it. Sexual symbolic re- 
sponse during that period decreased from 
66°% to 52% (approximately chance expect- 
ancy). This trend was a highly consistent 
one, occurring in the responses of both male 
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and female subjects to both male and female 
symbols. Although it is not possible to reject 
entirely the hypothesis that socially acquired 
learnings tend gradually to instill the sexual 
symbolic response, it appears that their effect, 
if any, is obscured in the years approaching 
puberty by the relationship between increased 
drive and lowered discrimination. (The curi- 
ous upward trend in the data of the third 
maturation level [approximately age 12 for 
girls, 14 for boys; see Figure 1] fails to reach 
conventional significance levels as tested by 
the cubic component of variance associated 
with maturation levels.) 

The finding of significant differences be- 
tween symbol types, with male symbols being 
responded to “correctly” more often than 
female symbols, confirms a trend in the data 
of the previous study (Jones, 1956). Whether 
or not this is a phenomenon of some generality 
remains unclear, since no attempt was made 
to equate the particular male and female 
symbols used for the degree of stimulus 
generalization which they represented. It is 
entirely conceivable that another set of al- 
leged sex symbols might lead to a contrary 
finding. What is suggested, possibly, is that 
it is easier to draw a male sex symbol than 
a female sex symbol. 

Comparison of the child data of the present 
study with the adult data of the previous one 
(Table 2) showed a significant increase in 
sexual symbolic response at young adult years 
over the highest frequency obtained in any 
of the prepubescent years. The frequency of 
response was 82% for the adult group (mostly 
in their early twenties) as compared with a 
high of 66% for children approximately 4 
years prepubescent. Although sex drive is 
generally believed to decrease slightly between 
adolescence and the early twenties, it does 
not seem plausible to regard the associated 
lowering of generalization gradients as the sole 
cause of the striking increase in symbolic 
response. A more plausible but clearly con- 
jectural explanation would be that following 
puberty individuals undergo a period of 
socially enforced discrimination training with 
respect to sexual cues and symbols, a degree 
of discrimination training not present in the 
prepubescent years. Thus, the overall develop- 
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mental history of the symbolic response, it is 
suggested, involves two phases with, most 
probably, different determinants. For at least 
4 years prior to puberty, there is a continu- 
ous decrease in sexual symbolic response that 
is interpreted as a result of decreased dis- 
crimination attendant upon increased drive. 
By the early twenties, however, the frequency 
of sexual symbolic response has exceeded its 
prepubescent maximum, presumably as a 
function of the increased discrimination train- 
ing implicit in the highly focused social con- 
trol of adolescent heterosexual behavior. The 
verification of such an explanatory principle 
rests, of course, upon empirical research yet 
to be carried out. 


SUMMARY 


The purpose of the present study was to 
assess the strength of the sexual symbolic re- 
sponse in children of various ages through 
pubescence. Sexual symbolic response was de- 
fined as the designation of pointed, elongated 
figures as male and of rounded, enclosing 
figures as female. The figures employed were 
relatively abstract geometric forms taken from 
earlier studies of Jones and of Levy. 

The mean frequency of sexual symbolic 
response was 66% for children approximately 
4 years prepubescent. By the age of puberty 
or a year or 2 beyond, however, response had 
dropped sharply to about chance expectancy. 
The finding was interpreted as an effect of 
decreased discrimination associated with 
heightened sexual drive as puberty is ap- 
proached. The data were compared with the 
performance of adults on the same task re- 
ported in a prior study. By the age of about 
22, the frequency of the sexual symbolic re- 
sponse (82%) was shown to climb beyond 
the prepubescent maximum, perhaps as a 
function of the increased discrimination train- 
ing implicit in the highly focused social 
control of adolescent heterosexual behavior. 
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RELATIONSHIPS BETWEEN 1960 STANFORD-BINET, 
1937 STANFORD-BINET, WISC, RAVEN, 
AND DRAW-A-MAN 


BETSY WORTH ESTES, MARY ELLEN CURTIN, ROBERT A. DEBURGER, 
AND CHARLOTTE DENNY? 


University of Kentucky 


The latest revision of the Stanford-Binet 
(S-B) test of intelligence was published in 
January 1960; therefore, it was thought ad- 
visable to compare the IQs of a group of white 
American children on the 1960 test with IQs 
made by the same group on the 1937 S-B and 
on the Wechsler Intelligence Scale for Chil- 
dren (WISC). In addition, Raven Progressive 
Matrices (1938 & 1947) and Goodenough 
Draw-A-Man (D-A-M) IQs for part of the 
group are compared with the two S-Bs and 
the WISC. 

The group consists of pupils attending the 
University of Kentucky School, grades one 
through eight. Parents’ socioeconomic status 
is above average; fathers’ occupations are 
managerial and professional. Tested intelli- 
gence of the pupils is likewise above average; 
mean IQ on the 1960 S-B equals 123; the 
range is 84 to 159. The selection criterion was 
the availability of scores for the 1937 S-B 
L and M forms and WISC full scale. All pu- 
pils meeting this criterion were included in 
this study, making a total of 82 for the major 
comparisons, 47 boys and 35 girls. 

The 1960 S-B was administered by the au- 
thors. All other tests were administered over 
a 4-year period by graduate students in psy- 
chology enrolled in an intelligence testing 
course. All of the tests were checked, rescored, 
and supervised by the senior author. 


RESULTS 


There were from one to four scores avail- 
able for each test; therefore, means were used, 
when available, for greater reliability. The 


1 Denny is at the College of Nursing. 


1937 S-B score was a composite of the L and 
M scores. 

Comparisons are reported in the form of 
group means and product-moment correlation 
coefficients. 

The 1937 S-B scores were converted accord- 
ing to the equation provided in the 1960 S-B 
manual (Terman & Merrill, 1960, p. 339). 
This equation provides corrections for the 
mean and standard deviation which are not 
precisely equal for all chronological age groups 
for the 1937 S-B. The converted IQ scores 
were generally lower than the unconverted 
scores, mean deviations by intelligence level 
varying from two to four points. Overall, how- 
ever, differences for means, standard devia- 
tions, and correlations were small and gener- 
ally inconsistent and, therefore, comparisons 
involving the 1937 S-B are based on uncon- 
verted scores. 


WISC, 1937 S-B, 1960 S-B 


Age. Littell (1960), in a review of WISC 
studies, concluded that children who are 
younger and rank higher on the 1937 S-B 
tend to score higher on the 1937 S-B than on 
the WISC. Table 1 shows no differential dis- 
crepancies in the present study due to chrono- 
logical age. The null effect of age has also 
been found by Weider, Noller, and Schramm 


' (1951), Gehman and Matyas (1956), and 


Schacter and Apgar (1958); hence, it ap- 
pears that the evidence for the age factor is 
inconclusive. When the 1960 S-B is compared 
with the 1937 S-B and with the WISC, again 
age is not found to be a factor. 


388 


Fay 
1 3 
OF 
3 
el 
— 
: 
ve 
| 


Relationships among S-B, WISC, Raven, and Draw-a-Man 


TABLE 1 
1Q Scores 


Group 

Average 

High average 
Superior 

Very superior 
CA 6-10 to 10-0 
CA 10-0 to 14-1 
Entire group 


12 2 
19 —1 
34 2 
17 8> + 1.93 
33° 1 
34° 4 
82 2 


Note.—Intelligence levels based on 1937 S-B scores. 
* Discrepancy = 0, p < .00002. 
» Discrepancy = 0, p < .002. 


N 1960S-B 1937 minus 1960 1937 S-B 1937 minus WISC 


WISC 1960 minus WISC 
102 1 

114 3 

128 9* + 1.24 

146 16* + 1.89 

124 8 

126 9 

125 7 


—1 


¢ The N for the CA groups is less than for the entire group because some children changed from the lower to the higher CA 


classification during the 4-year testing period. 


Intelligence. Littell’s conclusion regarding 
the effect of IQ level on WISC-1937 S-B 
test discrepancies is supported in the pres- 
ent study. Two-tailed ¢ tests were made when 
the discrepancies were greater than the five 
points usually allowed as test-retest error. For 
the two superior groups, the superiority of 
the 1937 S-B over the WISC is significant, 
p < .00002. This finding applies likewise to 
the 1960 S-B superiority over the WISC, p 
< .002. For average groups, WISC scores are 
comparable to both the 1937 and 1960 S-B 
scores. Discrepancies between both S-B tests 
and the WISC are greater for the superior 
levels than for the average levels, p < .00002 
for the 1937 S-B and the WISC and p = .02 
for the 1960 S-B and the WISC. Error terms 
for the 1937 S-B and the 1960 S-B compari- 
sons are 1.61 and 2.13, respectively. 1937 and 
1960 S-B scores are comparable except at the 
very superior level where the discrepancy is 
again highly significant, p < .002. Cronbach 
(1960, p. 171) reports, “IQs on the two scales 
are not strictly comparable.” 

Entire group. The major correlations for the 
entire group are presented in Table 2. Inter- 
correlations between the 1960 S-B, the 1937 
S-B, and the WISC do not differ significantly 
from each other. Although the population is 
a select one and the size of the sample is rela- 
tively small, the agreement found between the 
1937 S-B and the WISC compares favorably 
with the correlations reported for white Ameri- 
can children by Littell (1960), ie., in the 
80s. The correlation between the 1937 S-B 
and the WISC tends to be higher than that 


between the 1960 S-B and the WISC. This 
might be expected for two reasons: (a)_ the 
test interval is greater for the 1960 S-B and 
WISC administrations and (}) the 1937 S-B 
scores are based on two to four tests and, 
hence, may be more reliable than the 1960 
S-B scores which are based on one test. The 
correlation between the 1960 and 1937 S-B 
scores reaches a respectable validity figure. 
The 1937 L and M correlation compares fa- 
vorably with that reported by Terman and 
Merrill (1937, p. 47) for the 1937 S-B stand- 
ardization group. 


Raven and Draw-A-Man 


As a trend, the Raven Progressive Matrices 
is superior to the D-A-M in predicting the 
1960 S-B, the 1937 S-B, and the WISC 
(Table 3). Differences between the Raven and 
D-A-M are not significant, however. While the 
Raven and D-A-M scores account for a sig- 
nificant amount of the variance of the three 
major tests, the magnitude of these correla- 
tions is relatively small, and, consequently, 


TABLE 2 


Major CORRELATIONS 


1937 S-B, 
Test Form L 
1937 S-B 
WISC 
1937 S-B, M 


1960 S-B WISC 


82 .80—.80’s 
7 


.92—.91 


Note.—N = 82; all > 0, p < .005; representative finding 
on right. 
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TABLE 3 


RAVEN AND Draw-A-MAN CorRRELATIONS 


Test 1960 S-B 1937 S-B WISC 
D-A-M 43 46—.41 AS 
Raven 59 .67—.54 55—.91 

75 

Note.—N = 72; all > 0, < .005; representative findings 


on right. 


individual predictability is low. The D-A-M 
and 1937 S-B correlation compares favorably 
with that found by McHugh (1945) for pub- 
lic school kindergarten children but is much 
lower than Goodenough’s (1926, p. 51) 
D-A-M and 1916 S-B correlation of .74. The 
Raven and 1937 S-B correlation is higher than 
that found by Banks and Sinha (1951) for 
London school children, whereas the Raven 
and WISC correlation is much lower than the 
corresponding correlations of .91 and .75 
found by Martin and Wiechers (1954) and 
Barratt (1956), respectively, for American 
school children. It is difficult to evaluate com- 
parisons involving the Raven and D-A-M tests 
since there are so few representative studies 
in the literature. 


DISCUSSION 


The primary purpose of this report is to 
compare the 1960 S-B with its predecessor, 
the 1937 S-B, and another major children’s 
intelligence test, the WISC. The 1960 S-B 
was found to correlate equally well with the 
1937 S-B and the WISC, these correlations 
being comparable to representative findings 
for similar relationships. While the 1960 S-B 
was found to predict the 1937 S-B quite well 
for a fairly heterogeneous group, prediction 
was not equal for subgroups classified accord- 
ing to 1937 S-B intelligence levels. Agreement 
was found for average and superior levels but 
not for the very superior level. This partial 
disagreement of the two scales supports Cron- 
bach’s (1960) doubt that they are strictly 
comparable but further investigation with re- 
gard to intelligence level is indicated due to 
the relatively small size of the samples re- 
ported here. 

Intelligence level was likewise found to be a 
significant factor preventing high agreement 
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between both S-B tests and the WISC, a fact 
well supported by comparisons of the 1937 
S-B and the WISC. Implications from find- 
ings of the present study are that at average 
levels of intelligence WISC scores may be 
used interchangeably with scores from both 
S-B tests. This is not true for the superior 
levels where the obtained significant discrep- 
ancies of 7 to 16 points may easily place the 
scores in different intelligence classifications 
resulting in somewhat spurious or doubtful 
interpretations. However, it would be unwise 
to set definite limits for test comparability 
according to intelligence level on the basis 
of existing information which is not in com- 
plete agreement or strictly comparable. More 
information is needed drawn from larger and 
more representative samples. This informa- 
tion is no doubt available if existing test data 
were Classified and analyzed. 


SUMMARY 


1. The comparability of IQs from five dif- 
ferent intelligence tests was investigated for 
an above average group of white American 
children. 

2. For the entire group (NV = 82), scores 
for the 1960 S-B, the 1937 S-B, and the WISC 
were found to be comparable and to compare 
favorably with representative similar findings. 

3. The age factor, contrary to some previ- 
ous findings, was not found to account for test 
discrepancies among the two S-B and WISC 
instruments. 

4. Intelligence level, in agreement with 
previous findings, was a factor producing 
highly significant discrepancies at superior 
levels among the two S-B and WISC instru- 
ments. More investigation is needed regarding 
the effect of intelligence level on test com- 
parability. 

5. Correlations relating the Raven and 
D-A-M to the two S-Bs and WISC were sig- 
nificant but relatively small. 
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The meaning of the concept of “ego iden- 
tity” is still evolving. In Erickson’s perceptive 
essays on adolescent adjustment and behavior 
(1950, 1956), the construct is discussed and 
explored so richly that an easy and clearly 
sufficient operational translation of the no- 
tion cannot be formulated. A core meaning of 
the concept, however, is perhaps to be found 
in Erickson’s (1950) statement that “the 
sense of ego identity is the (individual’s) 
accrued confidence that (his) inner sameness 
and continuity are matched by the sameness 
and continuity of (his) meaning for others 
228). 

In this definition, three elements are pres- 
ent. First, an individual must perceive himself 
as having “inner sameness and continuity,” 
i.e., he must, over time, presume himself to be 
essentially the same person he has been. Sec- 
ond, the surrounding persons in one’s social 
milieu must perceive a “sameness and conti- 
nuity” in the individual also. And finally, the 
individual must have “accrued confidence” in 
a correspondence between the two lines of 
continuity, internal and external. His percep- 
tion of the person he sees himself as being 
must be validated by feedback from his inter- 
personal experiences. 

From this definition, one aspect of ego 
identity may be singled out for study, the 
dimension we have labeled role variability 
(RV). The meaning of role variability is per- 
haps most readily indicated by describing its 
extremes. At one end of this dimension, there 
is “role diffusion,” where an individual is an 
interpersonal chameleon, with no inner core 
of identity, fitfully reacting in all ways to all 
people. This kind of person is highly variable 
in his behaviors and is plagued by self-doubts 


This investigation was supported by Research 
Grant M-1078 from the National Institute of Mental 
Health of the United States Public Health Service. 


EGO IDENTITY, ROLE VARIABILITY, AND ADJUSTMENT* 


JACK BLOCK 


University of California, Berkeley 


392 


and despairs for he has no internal reference 
which can affirm his continuity and self-in- 
tegrity. At the other extreme, there is what 
might be called “role rigidity,” where an in- 
dividual behaves uniformly in all situations, 
disregarding the different responsibilities dif- 
ferent circumstances may impose. Here the 
core of identity is hollow, based not on a 
genuine and unquestioned sense of personal 
integrity but rather upon deep seated fear of 
any amount of self-abandon. Somewhere in 
between, presumably, a proper balance can be 
struck in the struggle both for identity and 
the capacity for intimacy. 

In terms of the preceding definition of ego 
identity, role variability may exist at the level 
of self-evaluation and separately, at the level 
of observations by others. It is the relation 
between these two levels of role variability 
that specifies ego identity or a problem of ego 
identity. In the present study, the focus has 
been limited to “sameness and continuity” as 
perceived and evaluated by the participant, 
and its significance for adjustment. 

Some years ago, the writer reported a study 
of the way in which an individual’s role be- 
haviors changed as a function of various in- 
teractional contexts (Block, 1952). A single 
subject was studied by having her system- 
atically describe her interactions with a set of 
“relevant others.” These descriptions were 
then factor analyzed—an instance of O-tech- 
nique (Cattell, 1946)—and it was observed 
that the factor dimensions appeared to order 
and to summarize, in a cogent way, the sev- 
eral kinds of roles this subject manifested. 

In this earlier study, the focal subject 
viewed herself quite differently, depending 
upon the person with whom she was con- 
fronted, and yet she still presented a sub- 
stantial core of interpersonal consistency. Al- 
though seeing herself as changing from rela- 
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tionship to relationship, a general factor of 
some consequence proved to underlie all her 
interactions. 

During the course of evaluation of this and 
other findings, the general question was raised, 
“How much . . . interpersonal consistency— 
i.e., interpersonal sameness—is socially appro- 
priate and, in terms of the individual’s in- 
ternal psychic economy, consonant with his 
need systems?” (Block, 1952, p. 285). In 
Erickson’s terms, role diffusion is a person- 
ally untenable situation for the individual. He 
is beset by too many despairs and self-contra- 
dictions as a consequence of his extreme be- 
havioral fluctuations. Role rigidity, where an 
individual is not affected in his behaviors as 
a function of the persons with whom he is in- 
teracting, may be an indefinitely prolonged 
adaptation of sorts, but certainly is not an 
optimal interpersonal solution. It prohibits 
further growth and development of the indi- 
vidual; it is effective as a security mechanism 
only so long as the interpersonal surround is 
a tolerant one. From this frame of reference 
follows the simple hypothesis the present 
study endeavored to test: “the amount of 
interpersonal consistency is curvilinearly re- 
lated to the degree of maladjustment, as de- 
fined independently” (Block, 1952, p. 285). 


METHOD 
Index of Role Variability 


A set of 20 adjectives selected a priori as reflecting 
various fundamental facets of interpersonal behavior 
served as the basic descriptive device. Each subject 
ranked this set of 20 adjectives eight times, so as to 
characterize his own behavior while with each of 
eight specified “relevant others.” The 20 adjectives 
employed were listed in the following order: relaxed, 
formal, indifferent, warm, independent, witty, co- 
operative, assertive, indecisive, distractible, humorous, 
insincere, masculine (or feminine if rater was a 
woman), wise, unselfish, trusting, worrying, sug- 
gestible, responsive, and protective. The subjects 
were asked to describe via this ranking of adjectives 
technique their behavior and relationships with the 
following eight individuals: someone in whom you 
are sexually interested, an acquaintance you don’t 
care much about, your employer or someone with 
equivalent status, a child, a parent (or parent figure) 
of the same sex, a parent (or parent figure) of the 
other sex, a close friend of the same sex, an ac- 
quaintance whom you would like to know better. 
These eight specified other persons were selected as 
sampling widely, if not exhaustively, the interper- 
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sonal possibilities of the subjects employed. The test 
materials were arranged in booklet form, the de- 
scriptions being recorded on separate pages. 

For each subject, the eight adjective rankings were 
intercorrelated by the Spearman rank-difference cor- 
relation method and the resulting 8 ¥ 8 correlation 
matrix factor analyzed by the Thurstone centroid 
method. For each matrix, the percentage of the total 
communal variance explained by the first unrotated 
factor was calculated. The first unrotated factor of 
a matrix reflects the degree of congruence among the 
set of variables being studied. For the present data, 
the first unrotated factor indicates the extent of in- 
terpersonal consistency a subject views himself as 
manifesting over the set of relevant others specified. 
By dividing the mean first factor loading (squared) 
by the average communality of the matrix, an index 
of role variability is derived which is comparable 
from matrix to matrix, and hence from individual to 
individual. When high, this score suggests that the 
individual involved views himself as essentially the 
same person in his several interactions—he is inter- 
personally consistent; when low, the subject has de- 
scribed himself as rather different from situation to 
situation—he is interpersonally changeable.? This in- 
terpersonal consistency score has a potential range of 
0 to 100. 


Measurement of Maladjustment 


Each subject also responded to the 480-item Cali- 
fornia Psychological Inventory (CPI) (Gough, 1957 
The CPI, by virtue of the conceptual scheme which 
has guided it from its inception, employs a pool of 
items which are understandable by and inoffensive 
to nonpsychiatric populations and it was for this 
reason this inventory was used, even though the es- 
tablished scales of the CPI provide no direct meas- 
ure of maladjustment. Although a knowledgeable in- 
terpreter can readily discern maladjustment from the 
nature of a subject’s CPI profile, for our present 
purposes a more direct CPI measure of maladjust- 
ment was desired. 

As part of the continuing program of scale de- 
velopment at the Institute of Personality Assessment 
and Research, it proved possible several years ago to 
establish, refine, and validate a personality scale to 
index an individual’s “susceptibility to anxiety.” This 
scale, labeled Psychoneuroticism (Pn), was devel- 
oped by the sequential application of cluster analysis, 
item analysis against cluster score criteria, a further 
method of dimensional purification, and finally, vali- 
dation on a number of different subject samples. An- 
other paper will describe in some detail the develop- 
ment of the Pn scale and several other new scales 

2 Often, extracting the first factor from each sub- 
ject’s interperson matrix will prove to be too endless 
a job of calculation for the individual researcher. In 
such instances, Kendall’s coefficient of concordance, 
computed for each subject, may be substituted as a 
quick and largely adequate index of interpersonal 
consistency. 
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with unusual properties but for the present report a 
brief description of the Pn scale is in order. 

The Pn scale contains a total of 45 items* of 
which 33 are contained in the MMPI item pool and 
19 in the CPI collection of items (the apparent in- 
consistency in arithmetic here is due to the inclusion 
in the CPI of a number of MMPI items). The 19 
Pn items scorable from a CPI protocol are sufficient 
in number to provide a reliable and dimensionally 
valid score. 

The psychological meaning of the Pn scale is best 
conveyed for the purposes of this paper by a listing 
of its relationships to other established scales. It cor- 
relates in the .70s and sometimes .80s with the MMPI 
Psychasthenia scale, the MMPI Manifest Anxiety 
scale compiled by Taylor (1953), and the Anxiety 
scale developed from a factor analysis by Welsh 
(1956). All of these scales are good representatives 
of the first underlying dimension repeatedly found in 
factor and cluster analyses of personality inventories 
(cf., e.g., Block & Bailey, 1955b; Cook & Wherry, 
1950; Cottle, 1950; Kassebaum, Couch, & Slater, 
1959; Tyler, 1951; Wheeler, Little, & Lehner, 1951). 
This dimension, variously measured, has proved of 
broad significance in both correlational and experi- 
mental studies (cf., eg. Block & Bailey, 1955a; 
Eriksen, 1954; Farber & Spence, 1953; Taylor, 1956) 
and it is clear now that individuals at the unfavor- 
able end of the continuum tend to be troubled, self- 
preoccupied, and vulnerable to happenings that for 
most would go unnoticed. The choice, in particular, 
of the Pn scale to index this dimension of suscepti- 
bility to anxiety was in large part dictated by the 
option it provided of scoring individuals from CPI 
protocols. This choice was buttressed in addition by 
the scale’s factorial origins and its development on 
nonpathological samples. 


Subjects 


Forty-one college students in a class on factor 
analysis collected the interpersonal and CPI data 
(and factored their respective small matrices). Ano- 
nymity was preserved for the participants. Many of 
the students collecting the data later admitted having 
used themselves as subjects in order to test the psy- 
chological insightfulness of the later factor analytic 
results. For this reason, it seems likely that the re- 
sponses of the subjects to both the interpersonal 
consistency and CPI procedures were offered with 
appropriate motivation. 


3A complete list of the CPI and MMPI items 
defining the Pn scale has been deposited with the 
American Documentation Institute. Order Document 
No. 6828 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; Wash- 
ington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to Chief, Photoduplication Service, Library 
of Congress. Copies of the scale are also available 
from the writer upon request. 
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RESULTS 


For the measure of interpersonal consist- 
ency, the mean score was 71.42, the standard 
deviation being 13.73. The distribution of 
scores was somewhat negatively skewed. The 
mean score on the Pn scale was 7.35, with a 
standard deviation of 3.02. Here the score 
distribution was moderately skewed positively. 

The hypothesis originally advanced asserts 
that a significant curvilinear relationship, as 
measured by eta, should exist between inter- 
personal consistency and maladjustment. A 
scatter plot, however, shows quite clearly that 
a linearity assumption fits the data well. The 
product-moment correlation between the index 
of interpersonal consistency and scores on the 
Pn scale is —.52, significant beyond the .001 
level. Individuals who tend to see themselves 
as varying from interaction to interaction are 
also more maladjusted, as measured by the Pn 
scale. The expectation that individuals with 
too little role variability would also prove to 
have weaknesses in their personality makeup 
was not confirmed. 

In order to understand more closely the sig- 
nificance of the interpersonal consistency in- 
dex, an analysis was undertaken of the 480 
items in the CPI. The 20 individuals with the 
highest interpersonal consistency score were 
constituted as one group and the 20 indi- 
viduals with the lowest interpersonal consist- 
ency scores were formed into a second group. 
For each of the CPI items, the relative fre- 
quencies of response of the two groups were 
evaluated by means of Fisher’s exact test 
for 2 X 2 contingency tables. The number of 
items discriminating beyond the .05 level of 
significance was in excess of the number to be 
expected on the basis of chance by a factor of 
three, clear evidence for their nonchance na- 
ture (Block, 1960). The content of these dis- 
criminating items can thus serve to enrich our 
understanding of the psychological meaning 
of the index of interpersonal consistency. To 
bring some order into the set of distinguish- 
ing items, they are presented as grouped into 
four tentative and somewhat overlapping cate- 
gories. Except where noted, for all the items 
listed, the individuals who are changeable in 
their interpersonal role tend to say “Yes” 
more often. Significance level and frequencies 
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of the Yes response in the interpersonally 
changeable and interpersonally consistent 


groups are indicated in parentheses. 


Items expressing social inarticulateness and concern 


83. I usually feel nervous and ill at ease at a for- 
mal dance or party (.01, 9-1). 

134. It makes me uncomfortable to put on a stunt 
at a party even when others are doing the same sort 
of thing (01, 9-0). 

173. My way of doing things is apt to be misun- 
derstood by others (.05, 8-1). 

198. Before I do something I try to consider how 
my friends will react to it (.05, 14-6). 

200. (Affirmed more by interpersonally consistent 
individuals) In a group of people I would not be 
embarrassed to be called upon to start a discussion 
or give an opinion about something I know well 
(.05, 11-18). 

260. (Affirmed more by interpersonally consistent 
individuals) I always try to do at least a little better 
than what is expected of me (.05, 9-16). 

285. I refuse to play some games because I am 
not good at them (.01, 12-3). 

373. My table manners are not quite as good at 
home as when I am out in company (.01, 20-13). 


Items expressing personal tension and neurotic char- 
acter 

358. I dream frequently about things that are best 
kept to myself (.05, 5-0). 

406. I have one or more bad habits which are so 
strong that it is no use fighting against them (05, 
5-0). 

425. I have often felt guilty because I have pre- 
tended to feel more sorry about something than I 
really was (.05, 7-1). 

453. I work under a great deal of tension (.05, 
7-1). 


Items expressing cynicism based upon disappointment 

48. Most people would tell a lie if they could gain 
by it (.01, 15-5). 

219. Most people inwardly dislike putting them- 
selves out to help other people (.05, 8-1). 

225. People pretend to care more about one an- 
other than they really do (.05, 12-4). 

342. Some people exaggerate their troubles in or- 
der to get sympathy (.05, 19-13) 

446. I must admit that people sometimes disap- 
point me (.05, 20-14). 


Items expressing familial tension 

164. My parents have often disapproved of my 
friends (.01, 11-0). 

268. At times, I have been very anxious to get 
away from my family (.05, 19-12). 

271. My parents were always very strict and stern 
with me (.01, 8-0). 


An item not admitting categorization 


333. Education is more important than most peo- 
ple think (.05, 19-13). 
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Summarizing the sense of these discriminat- 
ing statements, it seems clear that the indi- 
viduals seeing themselves as highly variable 
in their interactions experience strain and dis- 
may in their social endeavours, view the world 
as essentially unfriendly, and are personally 
troubled. 

DiIscussIoN 

The results obtained support but one slope 
of the hypothesized inverted U relationship. 
Although extreme role variability does appear 
to be related to an independent index of per- 
sonality maladjustment, extreme role stability, 
at least as represented in this study, is not 
also indicative of neurotic qualities. This par- 
tial support of the hypothesized relationship, 
moreover, is not a one time finding for an- 
other, albeit less adequate, test of the same 
hypothesis derived equivalent results. In a 
study of 50 Vassar alumnae some 20 years 
after graduation, where interpersonal incon- 
sistency was defined in almost exactly the 
way already indicated, role stability corre- 
lates again showed a consistent picture of 
(relative) psychological health. For example, 
women who are stable in their interpersonal 
role were described in ratings formulated com- 
pletely independently of scores on interper- 
sonal consistency as relatively “indulgent and 
forgiving, protective of those close to her, 
sympathetic, efficient, adequate in her sexual 
role, turned to for advice and reassurance, 
facially and gesturally expressive, and con- 
siderate.” Women who are variable in the in- 
terpersonal behavior were described as “irri- 
table and overreactive, talkative, ostentatious, 
and sarcastic.” Interpersonal consistency cor- 
related .29 with a consensus rating of degree 
of adjustment, a relationship significant at 
the .05 level. A finding by Meltzer (1957) 
to the effect that a large self—ideal-self dis- 
crepancy—a reasonable measure of self-recog- 
nized maladjustment—is significantly related 
to extreme role variability, defined by means 
equivalent to those used here, may also be 
interpreted as congruent with the present find- 
ings. It seems fair to conclude, then, that a 
positive relation exists between role variabil- 
ity as indexed here and the kind of malad- 
justment where the individual explicitly and 
consciously acknowledges his personal vulner- 
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ability. There is as yet no empirical sugges- 
tion that extreme interpersonal consistency is 
also related to maladjustment. 

When a hypothesis achieves partial but not 
complete confirmation, there is the possibility 
that either the hypothesis is wrong as stated 
or that it was not tested properly. Before 
abandoning a hypothesis held likely on other 
grounds, it is required that the measures em- 
ployed be evaluated for their adequacy and 
the sample in which the relationship was 
sought be evaluated with reference to its ap- 
propriateness for the hypothesis being tested. 
Let us consider these in turn. 

The use of a personality scale such as Pn 
to index the dimension variously called “sus- 
ceptibility to anxiety,” “neuroticism,” “ego 
weakness” and so on is, as referenced earlier, 
both well-established and well-supported. In 
a large number of studies, the correlates of 
scores reflecting this dimension have testified 
to its importance and meaning. We may pre- 
sume, therefore, that the maladjustment in- 
dex employed in the present study is both 
representative and, on the whole, effective.‘ 

The merits and properties of the interper- 
sonal consistency measure of course cannot be 
fully evaluated presently. In support of its 
construct validity, however, a number of argu- 
ments can be adduced. The operations appear 
to parallel the conceptual steps involved in 
deriving the notion of role variability. The 
task presented to subjects is, on the face of 
it, not especially threatening, hence removing 
much of the conscious motivation for offering 
“safe” and uninformative responses. The way 
in which the subject’s data are then processed 
so as to provide a score is sufficiently compli- 
cated and removed as to prevent a subject 
from readily controlling, when he ranks his 


*It is important to note that the Pm scale is, in 
this and a number of other studies, orthogonal to the 
Ego Control (EC) Scale, a scale developed to meas- 
ure tendency to constrict or to express impulse. The 
EC scale correlates an insignificant .15 with role con- 
sistency, reflecting a slight tendency for over control 
to go along with reduced role variation. Although 
certain scales reasonably equivalent to Pn emphasize 
expressive or overt reactions to anxiety and de-em- 
phasize suppressive and indirect reactions, the inter- 
relations of Pn, EC, and the role consistency measure 
in the present study suggest that the failure of Pn to 
be related to role rigidity cannot be ascribed to a de- 
ficient representation of covert maladjustment. 
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adjectives, the score he later obtains. Finally, 
the theoretically appropriate if as yet insuffi- 
cient correlations of the measure with inde- 
pendent variables in several studies suggest 
its proximity, at least, to the underlying con- 
cept of role variability. 

Perhaps the primary reason why the origi- 
nally formed hypothesis failed of complete 
confirmation in this and the other studies 
cited is that the samples involved, in all three 
cases, may have been too small and too homo- 
geneous to contain enough individuals who, in 
an absolute sense, were “role rigid.” This is a 
post hoc explanation, of course, but it may 
be that individuals who go on to college, in 
the natural course of their selective evolution, 
necessarily develop some amount of flexibility 
in their role behaviors. College students can- 
not be insensitive and unresourceful before 
the various role demands made upon them 
and therefore, in working with college sam- 
ples, very many of the individuals who are 
rigidly the same in their interpersonal en- 
deavors may already have been screened out. 

As another alternative to explore, it may 
be that extreme interpersonal consistency is 
an aspect of personality at a somewhat later 
age, when the personally desperate individual 
has found a self-definition that is acceptable 
to him. This last possibility is infirmed some- 
what by the inability to identify such indi- 
viduals in the older Vassar sample. What is 
required is another study, this time of a much 
larger and preferably less homogeneous sam- 
ple so that individuals who are extremely 
stable in their role behaviors may fairly be 
presumed to have been included. Although 
there has been confirmation and cross-valida- 
tion of the hypothesis that extreme role vari- 
ability will be associated with personality mal- 
adjustment, it is premature, we would suggest, 
to abandon the additional hypothesis that ex- 
treme interpersonal consistency is also asso- 
ciated with personality maladjustment. A fur- 
ther empirical effort is needed to discover 
whether there is a far side to the mountain. 
In the meanwhile, to the extent that role 
variability as measured here relates to role 
diffusion as conceived by Erickson, the pres- 
ent study offers support for the implications 
he has drawn between ego identity and be- 
havior. 
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SUMMARY 


From Erickson’s concept of ego identity, 
the dimension of role variability was ab- 
stracted. The hypothesis was advanced that 
excessive role variability (“diffusion”) and in- 
sufficient role variability (“rigidity”), be- 
cause they both reflect problems in ego iden- 
tity, would both be associated with maladjust- 
ment. A measure of the extent to which an 
individual perceives himself as varying in a 
variety of interpersonal situations was devel- 
oped. Role variability, so measured, proved 
to relate significantly to maladjustment, as 
measured by a CPI scale to measure “sus- 
ceptibility to anxiety.” Role rigidity did not 
relate to maladjustment. Supplementary find- 
ings were introduced and some possible rea- 
sons for the only partial confirmation of the 
curvilinear hypothesis were offered. 


REFERENCES 


Biock, J. The assessment of communication: Role 
variations as a function of interactional context. 
J. Pers., 1952, 21, 272-286. 

Bock, J. On the number of significant findings to 
be expected by chance. Psychometrika, 1960, 25, 
369-380. 

Biock, J., & Batey, D. E. A cluster analysis of 82 
inventory measures of personality, interest, and 
intellect. Berkeley, Calif.: Institute of Personality 
Assessment and Research, 1955. (a) 

Brock, J., & Bamry, D. E. Q-sort item analyses of 
a number of MMPI scales. Technical Memorandum 
OERL TM-55-7, May 1955, Maxwell Air Force 
Base, Alabama. (b) 

Cattett, R. B. The description and measurement of 
personality. New York: World Book, 1946. 


397 


Cook, E. B., & Wuerry, R. J. A factor analysis of 
MMPI and aptitude test data. J. appl. Psychol, 
1950, 34, 260-265. 

Cottie, W. C. A factorial study of the multiphasic, 
Strong, Kuder, and Bell inventories using a popu- 
lation of adult males. Psychometrika, 1950, 15, 
25-47. 

Erickson, E. H. Childhood and society. New York: 
Norton, 1950. 

Erickson, E. H. The problem of ego identity. J. 
Amer. Psychoanal. Ass., 1956, 4, 56-121. 

Ertxsen, C. W. Psychological defenses and “ego 
strength” in the recall of completed and incom- 
pleted tasks. J. abnorm. soc. Psychol., 1954, 49, 
45-50. 

Farper, I. E., & Spence, K. W. Complex learning 
and conditioning as a function of anxiety. J. exp. 
Psychol., 1953, 45, 120-125. 

Goucu, H. G. The California Psychological Inven- 
tory. Palo Alto, Calif.: Consulting Psychology, 
1957. 

Kassepaum, G. G., Coucn, A. S., & Starter, P. E. 
The factorial dimensions of the MMPI. J. consult. 
Psychol., 1959, 23, 226-235. 

Mettzer, M. L. Role variability as a function of the 
understanding of others. Unpublished doctoral dis- 
sertation, Catholic University of America, 1957. 

Taytor, Janet A. A personality scale of manifest 
anxiety. J. abnorm. soc. Psychol., 1953, 48, 285- 
290. 

Taytor, Janet A. Drive theory and manifest anx- 
iety. Psychol. Bull., 1956, 53, 303-320. 

Ty.er, F. T. A factorial analysis of fifteen MMPI 
scales. J. consult. Psychol., 1951, 15, 451-456. 

Wetsu, G. S. Factor dimensions A and R. In G. S. 
Welsh & W. G. Dahlstrom (Eds.), Basic readings 
on the MMPI in psychology and medicine. Min- 
nesota: Univer. Minnesota Press, 1956 

Wueeter, W. M., Lritre, K. B., & Lenner, G. F. 
The internal structure of the MMPI. J. consult. 
Psychol., 1951, 15, 134-141. 


(Received August 1, 1960) 


of 
| 
ra 
4 


Journal of Consulting Psychology 
1961, Vol. 25, No. 5, 398-404 


One facet of the anxiety state is the ex- 
perienced alteration of somatic functioning. 
Recent work by Mandler and his associates 
(Mandler & Kremen, 1958; Mandler, Mand- 
ler, & Uviller, 1958) has again called attention 
to this aspect of anxiety, termed by them “au- 
tonomic feedback.” An Autonomic Perception 
Questionnaire (APQ) was developed for the 
self-description of somatic symptoms charac- 
teristic of subjects’ anxiety experience. They 
have investigated whether and how the num- 
ber and/or intensity of such symptoms is re- 
lated to other measures of anxiety and to au- 
tonomic reactivity under stress as measured 
directly. In their first study, which compared 
selected high and low APQ scorers, high sub- 
jects more commonly reported somatic sensa- 
tions during intellectually stressful tasks, and 
generally showed greater autonomic reactivity 
in terms of polygraphic measurements (Mand- 
ler et al., 1958). While generally replicating 
these findings, a second study of unselected 
subjects found less distinct relationships be- 
tween APQ scores and either somatic report 
or autonomic reactivity under the same stress 
(Mandler & Kremen, 1958). Correlations be- 
tween the APQ and both the Taylor Manifest 
Anxiety scale and a newly developed Body 
Perception Scale were positive in both studies, 
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though considerably lower in the second. The 
present study is designed to replicate and ex- 
tend these findings, and generally to explore 


further the personality correlates of autonomic 
feedback.* 


METHOD 


The Nowlis Adjective Check List (ACL), Barron 
Ego Strength scale (Es scale), Taylor Manifest Anx- 
iety scale (MA scale), and the Autonomic Perception 


3 For consistency with the earlier work, Mandler’s 
terms will be used in this paper. However, the more 
inclusive and neutral term “somatic experience” 
seems to describe better the range of phenomena in- 
cluded in Mandler’s “autonomic perception” or “au- 
tonomic feedback.” Apparently, Mandler intended by 
these terms to describe sensitivity to the various 
bodily sensations which subjects might describe in 
emotional states, regardless of whether they reflect 
autonomic nervous system activity as such. Certainly, 
different orders of neurological and physiological con- 
trol are suggested by such diverse APQ items as: 


7. When you feel anxious, are you aware of in- 
creased muscle tension ? 


8. ... do you get a headache? 

16. ...do you feel as if blood rushes to your 
head ? 

19... . do you get a sinking or heavy feeling in 


your stomach? 


Just as “autonomic” suggests a too specific physio- 
logical mechanism, the words “perception” and “feed- 
back” connote too immediate and definite a relation- 
ship between the physiological event and the experi- 
enced symptom. Precisely how experienced symptoms 
are related to measurable physiological activity is the 
research question at the core of Mandler’s concern, 
though not an issue in this report. But, for clarity, 
it should be remembered that APQ assesses the va- 
riety and intensity of self-reported bodily experience, 
and that it defines a variable akin to what in clinical 
contexts is described as somatization or even hypo- 
chondriasis. Indeed, it would be appropriate and per- 
haps clearer to distinguish high and low APQ scorers 
as “somatizers” and “nonsomatizers.” 
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Questionnaire (APQ) were administered, in that or- 
der, to better than half of the entire student body of 
George Williams College in Chicago. Each of the four 
college classes was tested in a separate session. The 
study, it was explained, was intended to explore some 
relations among test measures of emotional states. 
Students were assured that individual scores would 
be confidential, and that only group results might be 
discussed with their college authorities. The present 
report is based on the scores of 176 subjects (139 
men and 37 women) for whom complete protocols 
were available. Subjects ranged in age from 16 to 40 
years. 

Mood Adjective Check List. The list consists of 
140 adjectives (including 10 duplicates) which the 
subject rates on a four-point scale as more or less 
characteristic of his mood state (Nowlis, 1953). Fac- 
tor analytic studies reported by Nowlis and Green 
(1957; Green & Nowlis, 1957) reveal eight factors: 
A, Concentration; B, Aggression; C, Pleasantness; 
D, Activation-Deactivation, a bipolar factor; E, Ego- 
tism; F, Social Affection; G, Depression; and H, 
Anxiety. The original instructions request that the 
subject reply in terms of his present mood. In this 
study, he was asked instead to judge the items in 
terms of what is generally characteristic of him, since 
our concern is less with the subject’s momentary mood 
than with his perception of more enduring modes of 
emotional behavior. As an additional rough estimate 
of emotional lability, which will not be treated in 
this report, the subject was also asked to circle all 
adjectives which described feelings he had during the 
preceding 24 hours. 

Barron Ego Strength scale. This scale was derived 
empirically from the MMPI in terms of items which 
distinguished patients who benefited from psycho- 
therapy from those who did not (Barron, 1953, 
1954). From correlations with other assessment meas- 
ures in additional patient and normal samples, Bar- 
ron interprets the test as a measure of ego strength, 
including such characteristics as health and physio- 
logical stability, strong reality sense, feelings of ade- 
quacy, vitality, lack of prejudice, emotional spon- 
taneity and outgoingness, and intelligence. 

Taylor Manifest Anxiety scale. This procedure is 
also a derivative of the MMPI, consisting of items 
originally selected by clinical psychologists as ex- 
emplifying the manifest symptoms of anxiety (Tay- 
lor, 1953). The Taylor and Barron scales were ad- 
ministered in a combined form. 

Autonomic Perception Questionnaire. The complete 
questionnaire described by Mandler, Mandler, and 
Uviller (1958) was administered and scored accord- 
ing to their procedure. The most relevant portion, 
for the present study, consisted of 21 graphic scale 
items which comprise their “Anxiety APQ” score. 
Each of these items starts with the dependent clause 
“When you feel anxious . . .” and then inquires into 
the frequency and/or intensity of symptoms in each 
of seven areas of bodily function. For example, the 
item “When you feel anxious, do your hands become 
cold?” is rated on a scale from “No change” to “Very 
cold.” An additional nine items describe symptoms 


TABLE 1 


Sex DiFFERENCE IN MEAN AvTONOMIC PERCEPTION 
QUESTIONNAIRE, TAYLOR MANIFEST ANXIETY SCALE, 
AND BARRON EGo STRENGTH SCALE 


Men Women 


Anxiety APQ 
Pleasure APQ 
MA scale 

Es scale 


81.00 
19.46 
17.65 
46.43 


related to pleasure. These are of the same form, 
though starting with the clause “When you feel 
happy .. .” The sum of these nine items provides a 
“Pleasure APQ,” which will be of lesser concern in 
this report. 

An additional group of nine anxiety items was de- 
veloped to extend coverage in the areas sampled, 
and to tap further somatic areas (eg., faintness, 
nausea, polyuria, diarrhea). Scores based on these 
new items correlated highly with the original, 21- 
item, Anxiety APQ—r= .736 (male subjects) and 
r= .739 (female subjects), both significant at <.001 
level. Moreover, the new and original APQ scores 
vary in precisely the same way with each of the 
other personality measures. However, for greater 
comparability with earlier work, the Anxiety APQ 
analyses reported in this paper are based only on the 
original 21 items. 


RESULTS AND DISCUSSION 
Sex Differences in Mean Test Scores 


There are distinct and significant differences 
in mean APQ, MA scale, and Es scale scores 
between men and women (Table 1). Women, 
in this population, are higher in manifest 
anxiety, lower in ego strength, and report 
more somatic experience. The female mean 
Pleasure APQ is also higher; though insignifi- 
cantly so. 

The magnitude of these differences is puz- 
zling, particularly since none of the original 
reports of these procedures describes similar 
sex differences in seemingly comparable popu- 
lations. Although women had slightly higher 
MA scale values in the original Iowa sample 
(Taylor, 1953), the difference was insignifi- 
cant. Mandler and Kremen (1958) found no 
difference in mean APQ between Harvard 
summer school men and women, though they 
did discover some differences in autonomic 
response measures. Similarly, Barron (1953) 
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Anxiety APQ 


Test M 


Anxiety APQ 
Pleasure APQ 
MA scale 


Note.—Male N = 139; female N = 37. 
*p <.01. 


mentions no sex difference in ego strength 
scores. 

In view of the decided sex differences in 
these measures, it is interesting that sexes do 
not differ at all in the ACL factor scores 
for anxiety and depression. Indeed, only one 
of the nine ACL scores (Social Affection, p 
< .05) is significantly higher for women than 
for men. There is some tendency for women 
to be higher in Activation, but only at the 
p < .10 level. 

There is no ready explanation for the siz- 
able sex differences. However, in view of them, 
it seems necessary to make further correlative 
analyses of tests within each sex group sepa- 
rately, lest the pattern of intertest correlation 
also differ between sexes. It should be noted, 
at this point, that the pattern of mean differ- 
ences—higher APQ and MA scale going with 
lower Es scale—is in the direction predictable 
from the pattern of intertest correlation. 


Relationship of Autonomic Feedback to Mani- 
fest Anxiety and Ego Strength 


Intercorrelations among APQ, MA scale, 
and Es scale for male and female subjects 
separately are given in Table 2. For both 
sexes, it is clear that those who report more 
Somatic experiences in emotional states are 
higher in manifest anxiety and lower in ego 
strength. The MA scale and Es scale, as 
might have been anticipated, show substantial 
negative correlations. The correlation pattern 
supports Mandler’s contention that autonomic 
perception is part of the anxiety complex. 
Subjects who are less capable of integrative 
functioning and who are simultaneously more 
prone to emotional disturbance experience a 
wider range and more intense somatic symp- 
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TABLE 2 


INTERCORRELATIONS AMONG AUTONOMIC PERCEPTION QUESTIONNAIRE, TAYLOR MANIFEST ANXIETY 
SCALE, AND BARRON EGO STRENGTH SCALE FOR MALE AND FEMALE SUBJECTS 


Pleasure APQ 


F 
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MA scale Es scale 


M F 
.509* .410* .634* — .258* — .423* 
.288 — .300* — 317 


toms. The APQ vs. MA scale correlations of 
this study are of about the same order as 
those reported by Mandler et al. (1958) in 
their first report, though somewhat higher 
than the coefficient of .267 between Anxiety 
APQ and MA scale reported in their second 
study (Mandler & Kremen, 1958). Although 
the correlations between procedures within 
each sex group are generally in the same 
range, it is noteworthy that those for women 
are in all cases higher. 


Relationship of Autonomic Feedback to Ad- 
jective Self-Description 


In order to compare autonomic feedback to 
subjects’ ACL self-description, the male and 
female distributions of Anxiety APQ scores 
were divided into near-equal thirds; thus 
forming high, median, and low APQ groups 
of men and women. These APQ groups are 
compared, first, in terms of the Nowlis-Green 
factor analytically defined variables (Table 3) 
and, second, in the distribution of their rat- 
ings on the 130 adjectives considered indi- 
vidually (Table 4). Three-way split of the 
APQ distribution was used to detect curvi- 
linearity, if present. Although the mean ACL 
scores of the median APQ group often fell 
close to, or beyond, one of the extreme groups, 
in none of the significant comparisons pre- 
sented in Tables 3 and 4 was the relationship 
clearly curvilinear. Hence, these findings may 
be taken as generally descriptive of the differ- 
ences in ACL variables between higher and 
lower APQ scorers, of a roughly linear sort 
and certainly characteristic of extreme APQ 
groups. 

The two factor variables which best dis- 
tinguish the APQ groups are Anxiety and De- 
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pression (Table 3). Among both men and 
women, high APQ subjects are significantly 
higher in their mean Anxiety and Depression 
scores than the median and low APQ scorers. 
Suggestive, though less impressive, differences 
are found in the sets of scores defining the 
two poles of the Activation factor. Both male 
and female high APQ subjects tend to de- 
scribe themselves in Deactive terms (though 
the F ratio is not significant for women). 
However, among women, high APQ scores 
also tend to be higher in Activation, while the 
men if anything trend in the opposite di- 
rection. 

More detailed examination of the indi- 
vidual adjective ratings amplifies these find- 
ings, and makes clearer the apparent sex dif- 
ference in Activation. In Table 4 are listed 
the adjectives in which the three APQ groups 
differ, on chi square analysis, at five levels of 
significance. It might be noted that one would 
expect by chance 26 significant comparisons 
at the .10 level or better in the 260 compari- 
sons (130 adjectives, 2 groups). In fact, there 
are 56, over twice as many. Of greater impor- 
tance, however, is the internal consistency 
and apparent sense that can be made from 
the pattern of adjective self-descriptions. 

High APQ male subjects, contrasted to 
those reporting fewer and less intense somatic 
symptoms, describe themselves as inadequate, 
inactive, and helpless people. The self-image 
of weakness and incompetence is conveyed by 
the positive endorsement of such items as 
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weak, helpless, hesitant, and the like; and by 
low ratings on items which suggest active 
mastery, such as independent, resourceful, 
and effective. Emotionally, the adjective pat- 
tern suggests depression and feelings of fu- 
tility—-ashamed, downhearted, clutched up, 
frustrated—rather than any acute emotional 
distress such as might be expressed in hos- 
tility or anxiety. They are defeated, rather 
than angry men. Overall, one has the impres- 
sion of ego-weak, inadequate, and dependent 
people whose symptoms are more those of 
“neurotic debility” than of acute emotional 
distress. 

While high APQ female subjects, compared 
to their low APQ sex mates, also convey a 
distinctly neurotic impression, the pattern of 
adjective self-description differs from the male 
and suggests a more complex syndrome. As 
with the men, there are signs of inadequacy 
and weakness, but along with this consider- 
ably more evidence of stronger and more 
labile emotions—belligerent, lonely, over- 
joyed, irritated, angry. These subjects seem 
to experience higher levels of excitement, with 
wider swings of mood and activity, and more 
capacity for energetic striving, though per- 
haps with no more assurance of success than 
the men. Compared to their male counter- 
parts, high APQ females seem less concerned 
with their inadequacy, while describing more 
hostile and generally emotional interaction. 
They are more active and aggressive, men 
more passive and self-doubting. In terms of 


TABLE 3 


COMPARISON OF Now.is ApjectiveE Cueck List Factor Scores ror Low, MEDIAN, AND HIGH 
AUTONOMIC PERCEPTION QUESTIONNAIRE SCORERS, MALE AND FEMALE Supjects SEPARATELY 


Male APQ groups 
Median High F 
Concentration (A) 8.0 
Aggression (B) 5. 5.0 
Pleasantness (C) 
Activation (D+) 
Deactivation (D—) 
Egotism (E) 

Social Affection (F) 
Depression (G) 
Anxiety (H) 


= 
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Note.—Male N = 139; female N = 37. 


Female APQ groups 


Median 


Low 
7.8 8.2 8.2 
3.8 44 6.1 
7.4 8.1 8.8 
ns 8.7 99 10.8 
<.05 4.2 4.2 5.8 
ns 3.5 3.0 43 
ns 9.8 99 10.4 
<.01 2.2 2.7 6.1 
<.005 3.1 3.2 5.2 


* Variances were not homogeneous; therefore, the Kruskal-Wallis H test was substituted for the F test. 
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4 13 ns 
6 2.08 ns 
8 
+) 2 295 <.10 
2 2.02 ns 
0 1.22 ns 
2 8.828 <.01 
5 5.52 <.01 


TABLE 4 


FEMALE GROUPS 
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ApyectiveE SELF-RATINGS WHICH SIGNIFICANTLY DiF 

FERENTIATE HiGH, MEDIAN, AND Low ANXIETY 
AUTONOMIC PERCEPTION QUESTIONNAIRE 
SCORERS WITHIN MALE AND WITHIN 


Adjectives* which differentiate 
high, mecian, and low 
among: 


Significance level Men Women 
<.001 jittery 
doubtful 
shocked 
<O1 hel pless insecure 
inactive full of pity 
thirsty sleepy 
— wideawake __ belligerent 
lonely 
<.02 ashamed downhearted 
washed out regretful 
grouchy overjoyed 
— independent — calm 
* — effective 
<.05 downhearted hel pless 
startled startled 
frustrated energetic 
hesitant skeptical 
slow irritated 
bored angry 
clutched-up — optimistic 
| weak 
— resourceful 
— bold 
— careful 
<.10 self-conscious self-conscious 


jittery 
insecure 
dissatisfied 
timid 
smug 

blue 

— serious 
— alert 

— satisfied 


ACL. 


sign appears before the adjective. 
both the male and female list are italicized. 


Note.—Male NV = 139; female N = 37. 

* Adjectives are recorded here exactly as they appear in the 
If there is an inverse relation between APQ level and 
adjective rating (i.e., Highs rated lower than lows), a minus 
Adjectives which appear in 


frustrated 
hesitant 
careless 
boastful 
subdued 
restrained 


the sex norms of our culture, one might specu- 
late that the core neurotic problem expressed 
by each high APQ group represents inability 
to meet its particular sex standards: the males 
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are too dependent and ineffectual to be men; 
the females are too hostile and energetic to be 
women. The two adjective patterns suggest 
the “castrated male” and “penis-envy female” 
syndromes of psychoanalysis. Less specula- 
tively, it can be concluded that ACL analyses 
fully support the earlier presented correlative 
analyses in showing individuals reporting 
greater autonomic feedback to be generally 
more neurotic, less well integrated, and more 
prone to emotional disturbance, while further 
suggesting personality characteristics at odds 
with effective sex role functioning. 


Somatic Symptom Choice 


It may be of some general interest to note 
the relative popularity of various somatic 
symptoms contained in the APQ, and to con- 
sider whether the sexes differ in their symp- 
tom choice. For rough and exploratory analy- 
sis, the 30 item means (original 21 plus our 
additional 9 items) were separately ranked 
for women and men, to compensate for the 
sex differences in mean scores already noted. 

In general, men and women are quite simi- 
lar in their relative orders of symptom choices. 
Below are listed the five items receiving the 
highest ranks and the five ranked lowest by 
the entire student group. In parentheses, the 
rank for men, then for women, is indicated. 
Those items added to the original APQ are 
marked with an asterisk. Recall that all items 
are preceded by the stem, “When you feel 
anxious. . . .” 


Most highly rated items: 
do you get a fluttering feeling in your 
stomach (“butterflies”)?* (1, 1) 
how often are you aware of bodily reac- 
tions? (3, 2) 
are you aware of increased muscle tension? 
(2, 6) 
do you ever feel weak or shaky?* (6, 3) 
are you aware of many bodily reactions? 
(4, 7) 
Least highly rated items: 
do you experience nausea?* (30, 30) 
do you feel as if you might faint?* (29, 27) 
do you have to defecate frequently (diar- 
rhea)?* (28, 28) 
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do you experience a slowing of the heart?* 
(27, 29) 
do you get a headache? (26, 23) 


Note that four of the five least common 
symptoms are in response to items introduced 
by us. Perhaps such symptoms occur only with 
higher degrees of anxiety than these essen- 
tially healthy young men and women have 
experienced. 

Seven of the items had identical ranks, or 
ranks differing only by one unit. In addition 
to the four included in the lists above, men 
and women were essentially alike on: 


does your mouth become dry? (10, 10) 

do you have to urinate frequently? (15, 16) 

are you bothered by your bodily reactions? 
(17.5, 18.5) 


By contrast, inspection of the seven items 
which most differed in the rankings for men 
and women suggests some sex differences: 


do your hands become cold? (25, 8.5) 

do you have difficulty talking? (19, 8.5) 

does the intensity of your heart beat in- 
crease? (11, 20) 

how often are you aware of change in your 
breathing? (12, 21) 


does your stomach get upset? (20, 13) 

do you perspire? (5, 11.5) 

do you get a lump in your throat or choked- 
up feeling? (9, 4) 


Though the overall picture is one of greater 
agreement than difference in the somatic ways 
in which men and women express anxiety, the 
differences that are found seem consistent with 
clinical psychiatric and psychosomatic experi- 
ence. Thus, women more commonly name cold 
hands, reminiscent of Raynaud’s disease which 
is more frequently found among women, and 
difficulty in talking and lump in the throat, 
suggestive of hysteria. Though the differences 
are somewhat less great, men favor cardiac 
and respiratory symptoms, perspiration, and 
muscle tension, which taken together suggest 
the normal physiological reactions to exercise 
and exertion. Cardiovascular diseases are more 
commonly found among men, at least as in- 
dicated by mortality statistics in middle age. 
An important and long standing question in 
psychosomatic research, to which these data 
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may contribute slightly, is whether there are 
individual and group, such as sex, differences 
in somatic response in the acute emotional 
state which might be predictive of “symptom 
choice” in the psychosomatic disease state. 


Relation between Anxiety and Pleasure APQ 


Thus far, attention has been centered on 
somatic experience in anxiety (the anxiety 
APQ score), and consideration of the signifi- 
cant positive correlation between Anxiety and 
Pleasure APQ noted in Table 2 has been by- 
passed. In their original study, Mandler, 
Mandler, and Uviller (1958) report correla- 
tions of .50 and .45, for two male samples, 
between the two autonomic feedback scores. 
Although little systematic attention is given 
to this relationship—and in their later report 
Mandler and Kremen (1958) do not treat 
Pleasure APQ at all—these findings suggest 
that somatic sensitivity may be an individual 
characteristic in all or many states of emo- 
tional arousal. Generally anxious and ego- 
weak people, as we have already seen, are 
more prone to such experience, but apparently 
when happy as well as when anxious. At the 
same time, all subjects report more somatic 
experience in anxiety than in pleasure. Com- 
parison of the duplicated items in the two 
scales shows consistently and significantly 
higher ratings for the Anxiety than for the 
Pleasure form. Apparently, then, while all 
subjects experience more somatic involvement 
in anxiety, individual differences in sensitiv- 
ity to such experience are consistent across 
both states. 


SUMMARY 


This paper reports a correlative study of 
autonomic feedback based on the Mandler 
Autonomic Perception Questionnaire, Barron 
Ego Strength scale, Taylor Manifest Anxiety 
scale, and Nowlis Adjective Check List scores 
of 139 male and 37 female college students. 
Women are significantly higher in autonomic 
feedback scores, lower in ego strength and 
higher in manifest anxiety, though the within- 
sex correlations were substantially the same 
for both sexes. In both groups, autonomic 
perception correlated positively with manifest 
anxiety and negatively with ego strength; the 
latter two correlating negatively between 
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themselves. High APQ scorers had signifi- 
cantly higher scores on the Nowlis-Green fac- 
tor variables of anxiety and depression. How- 
ever, within each sex group, proneness to au- 
tonomic feedback may characterize those who 
are least successful in their sex roles. In ad- 
jective self-descriptions, high APQ males re- 
veal themselves as ineffectual, dependent, and 
depressed, high APQ females as more active 
and aggressive with stronger and more labile 
emotions. Thus, it may be concluded that re- 
porting more numerous and intense somatic 
experiences in emotional states is generally 
characteristic of more neurotic, inadequate, 
and anxious individuals, though the particular 
neurotic problem may differ for each sex. 

Though more autonomic feedback is re- 
ported in anxiety than in happiness, the posi- 
tive correlation between autonomic feedback 
scores in the two states suggests that the 
tendency for somatic involvement, at least as 
assessed by the APQ, is a general character- 
istic of individuals in a variety of emotional 
states. 
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THE BENDER GESTALT: 
A CLINICAL STUDY OF CHILDREN’S RECORDS 


WENTWORTH QUAST 


University of Minnesota Medical Center 


The need for indicators of brain dysfunc- 
tion at all ages is apparent to clinical psy- 
chologists, but is particularly acute in the as- 
sessment of children where changes with age 
and wide variation of many growth charac- 
teristics within an age range complicate the 
problem. Incidence figures for brain injury in 
children in this country are highly variable, 
although approximately three million is an 
estimate made by Martha Elliott (1956), 
Chief of the Children’s Bureau. This figure 
does not include an additional number of chil- 
dren with behavior and learning disorders pre- 
sumed to result from organic pathology. 

One of the main problems confronting the 
child clinical psychologist is in the relative 
weights to be assigned to emotional, func- 
tional, dynamic, or learned components as 
contrasted with intrinsic, constitutional, or 
organic components in the unusual symp- 
tomatology presented by the child patient. 
One measure of presumably intrinsic defects 
is difficulty in coordination as evidenced in 
visual-motor tasks. For research and clinical 
evidence that organic patients do have visual- 
motor difficulties, the reader is referred to re- 
views by Klebanoff (1945) and Klebanoff, 
Singer, and Wilensky (1954). The present 
study was undertaken to test the validity of 
the Bender as an indicator of organic com- 
ponents using 100 child subjects, either in- 
patients or outpatients of the Division of 
Child Psychiatry, University of Minnesota 
Medical Center. 

The problem of establishing clear criteria 
for brain injury in children is complicated by 
the subjective nature of neurologic examina- 
tion, by problems of reliability and validity 
in EEG interpretation, and by the frequent 
lack of any demonstrable physical changes in 


a child with known central nervous system 
defect. A survey of the final diagnoses of 325 
consecutive inpatient admissions showed acute 
or chronic brain syndromes to constitute about 
25% of the population. Considering the lack 
of definitiveness in making such a diagnosis, 
this figure is probably low. In the search for 
criterion groups it was felt more meaningful 
to examine the original impressions of refer- 
ring physicians or agencies, the presenting 
complaints, or the differential diagnoses con- 
sidered at the time of admission to hospital 
for suspicions of brain injury. When these 
are considered and the problem becomes one 
of examining for possible central nervous sys- 
tem involvement, the proportion of suspected 
brain injured becomes more nearly 50% of 
the clinic population. If the problem of men- 
tal deficiency were to be included, the prob- 
lem of determining “familial” versus ‘‘or- 
ganic” etiology is also at least a 50-50 base 
rate problem. It should be clear that the terms 
“brain injury” and “suspected organic brain 
damage” are used in a categorical sense, to 
indicate a wide range of central nervous sys- 
tem defects. Also it should be clear, while in- 
accuracies on the Bender beyond a certain 
age may be related to cerebral defect, the ex- 
tent of such defect is not implied. The test is 
examined as a screening device capable of 
separating out those children who warrant a 
“second look” neurologically. 


METHOD 


Since it was possible to identify some homogeneity 
among various presenting complaints of child psy- 
ciatric patients, criterion groups were designated ac- 
cording to whether or not brain damage was sus 
pected: (a) those with suspected emotional disturb 
ance without suspected brain damage and (b) those 
with suspected brain damage, with or without emo 
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tional disturbance. Notations regarding .any suspected 
organicity appeared in the hospital chart in the form 
of a physician’s referral specifically for neurological 
study; in the form of behavioral complaints from 
physician or schools related by them to brain injury 
by inference, such as extreme hyperactivity, short 
attention span, impulsivity, etc.; and in the form 
of notes regarding questionable or known seizures, or 
notes as to sequelae from illnesses or trauma. 

The emotionally disturbed group consisted of pa- 
tients in whom brain injury was not mentioned as 
suspect by any person in the chain of individuals 
from the original source of referral through the final 
examining physicians in hospital. Presenting com- 
plaints common to this group were fears, obsessions, 
lying, stealing, truancy, pathological shyness or with- 
drawal, allergies, and somatic complaints. No child 
whose presenting complaint was mental retardation 
was included in either group. 

The resulting two groups were matched relative to 
their socioeconomic level in proportions according to 
father’s occupation using the Minnesota Scale of 
Parental Occupation. The lower age limit was 10 (9 
years 10 months), and the upper age limit 12 (12 
years 11 months). A limit of 1 year’s age span would 
have been most desirable to insure homogeneity in 
this regard but sufficient numbers were not available 
within such a narrow limitation. Children aged 10 
were chosen since normative data previously obtained 
by Quast (1957) indicated absence of most Bender 


TABLE 1 


RELATION OF BENDER ATTRIBUTES TO CRITERION GrRouPS (SUSPECTED 
Bratn DAMAGED AND SUSPECTED EMOTIONALLY DISTURBED) 


deviations by that age. A number of “organic signs” 
on the Bender seen even in adult records occur as 
normal developmental phenomena up to age 8 but 
usually not beyond 10. Thus, such an age selection 
controlled developmental deviation to large extent. 

It was impossible to make the sex ratio comparable 
to the population at large since the usual clinic ratio 
averaged two boys to one girl. The final selection of 
patients was considered to be a representative sam- 
ple of the clinic population. 

The mean IQ for the emotionally disturbed group 
was 99.5, with a standard deviation of 15.72. Mean 
IQ for the suspected brain damage group was 81.7, 
with a standard deviation of 16.55. The difference 
in mean IQ was a reliable one with a ¢ of 5.57, 
p = < .0001; however, previous data (Quast, 1957) 
would indicate IQ per se not to be a significant de- 
terminant in Bender performance. Also, the mean 
mental age of the brain damage suspects (9 years 1 
month; SD, 2 years 1 month) exceeded the cutoff 
mental age 8, below which deviations occurred as 
normal developmental phenomena. 

The two groups were similar in that they could be 
considered “typical” child psychiatric patients. They 
shared common referral sources and represented a 
wide range of problems. 

The Benders were administered by the writer, or 
by advanced graduate students during their hos- 
pital internship in clinical psychology. All tests were 
scored according to the Peek-Quast system (1951) 


© Brain Damaged % Emotional Relation of Level % Normative* 

Suspects Suspects Sign to Criterion of Ages 10 & 12 

Showing Sign Showing Sign Groups in Signifi- Showing Sign 

Attribute or sign (N = 50) (N = 50) Terms of @ cance (N = 100) 
Scalloping 24 2 327 01 1 
Dashing 24 4 .288 1 6 
Perseveration 48 12 393 01 14 
Rotations, 2 or more, +2 or 

+3, not Figures a, 3 32 Ps 399 01 6 
Reversal 26 0 387 01 3 
Confabulation 26 0 387 01 1 
Angulation, +2 40 2 483 01 3 
Mixed orientation 36 36 000 - 11 
Card turning, inversion 24 12 .156 0 
Major distortion 56 4 567 01 7 
Erasures, absence of 50 26 .247 05 42 
Excessive pressure 46 50 — .040 ~ 78 
Separation, +2 20 0 333 01 6 
Flattening 24 22 024 28 
Exaggeration 64 42 .220 05 39 
Line substitution 0 0 .000 0 


Slope 
Global clinical impression 


® Presented for convenience in comparison. 
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TABLE 2 


INTERCORRELATIONS BETWEEN ATTRIBUTES CHARACTERISTIC OF THE 
SuspecTtep Brartn DAMAGE SAMPLE 


10 


Note.—Correlations are phi coefficients. 1 = Scalloping, 2 = Dashing, 3 = Perseveration, 4 = Rotation, 5 = Reversal, 
6 = Confabulation, 7 = Angulation, 8 = Major Distortion, 9 = Separation, 10 = Slope. 


by the writer without knowledge of the patient's 
name or classification. A global judgment was made 
on appraisal of the record as a whole, on a no brain 
damage versus brain damage dichotomy, prior to 
scoring. 

Analysis of the scored records consisted of com- 
paring the two groups for the presence or absence of 
a number of Bender attributes or signs, found in the 
examiner’s experience to be most common in the 
records of brain injured children and adults. An 
a priori selection was made of 17 attributes most 
likely to discriminate the two groups. 

Item validity was obtained by the use of Jurgen- 
sen’s tables (1947) for the determination of phi co- 
efficients, and levels of significance were calculated. 
For those attributes showing the best discrimination 
(.01 level), intercorrelations were also calculated. 

The scores were assumed to be dichotomous so the 
correlation method selected was the phi coefficient. A 
test of the null hypothesis can be made through phi’s 
relationship to chi square. Chi square = N¢%*. If chi 
square is significant in a four-fold table, the corre- 
sponding phi is significant at the same level. Spe- 
cifically, where N = 100, chi square is significant at 
the .01 level if it is at least 6.6, and at the .05 level 
if it is at least 3.8. 


RESULTS AND DIscUSSION 


The results of the comparisons between 
the performances of the suspected brain dam- 
aged and the suspected emotionally disturbed 
groups are presented in Table 1 which ex- 
presses the relation of the Bender attributes 
to the criterion groups in terms of phi coeffi- 
cients and the levels of significance. Norma- 
tive data (Quast, 1957) on the 10- and 12- 
year olds combined are presented for com- 
parison, 

It is seen that 10 of the attributes separate 


the two groups at the .01 level of significance. 
Two others showed significant differences be- 
tween the groups at the .05 level. The fact 
that clinical impression of the record as a 
whole separated the groups best is not sur- 
prising when it is learned that the suspected 
brain damage group averaged 3.6 attributes 
per patient as contrasted with a mean of .60 
for the suspected emotionally disturbed. False 
positive Bender signs in the suspected emo- 
tionally disturbed occurred to a significant 
extent on only 1 of the 10 discriminating at- 
tributes. This attribute, Slope, while occur- 
ring in two-thirds of the brain injury suspects, 
occurred in one-third of the suspected emo- 
tionally disturbed. The busy clinician should 
note that the most discriminating signs were 
generally also the easiest to score. These data 
suggest that a cutting score based on the 
signs which give optimal discrimination, pos- 
sibly weighting the better signs, could be de- 
veloped through a cross-validation study. 
When intercorrelations were calculated for 
these 10 attributes (Table 2) it was found 
that in general they appeared to show little 
intercorrelation with one another. Because of 
interest in the underlying strength of relation- 
ship between signs rather than in making pre- 
dictions from one sign to another, and rec- 
ognizing the restriction which reduction of 
frequencies to 2 * 2 tables places upon phi 
coefficients, the obtained phi’s should be in- 
terpreted for size in light of the maximal phi 
coefficients possible. About 8 of the 45 inter- 
correlations were appreciably affected by such 
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1 —.20 +.43 —.06 +.06 00 +.30 +.12 +.16 00 a 
2 +.19 —19 00 +.06 +.14 00 00 +.13 
3 +,.02 +.42 +.28 +.03 +.12 —.07 
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considerations (for example: ¢is3max = -65, 
when obtained ¢ = .43; ¢36max = -61, when 
obtained ¢ = .42). 

These data are consistent with the hypothe- 
sis that the Bender may be tapping a variety 
of defects which may exist, in combination or 
separately, in an individual patient. If this is 
true and each sign is valid, and the probabil- 
ity of their occurring in patterns is low (low 
correlation), then a count of critical signs 
takes on considerable significance for diag- 
nostic purposes. Validity of a total score is 
much enhanced when separate parts or items 
are valid individually but modestly intercor- 
related. However, a combinatory sign ap- 
proach may be ultimately less helpful in terms 
of explanation of the deviations than a fur- 
ther exploration of a particular sign. 

For example, while the attribute Persevera- 
tion can logically be linked with other known 
perseverative behaviors and thinking of the 
brain injured person, Scalloping, Rotations, 
Angulation, and/or others, may be linked with 
defects hitherto unexplored. Attention to in- 
. dividual signs may also throw light on the 
problem of whether a more fundamental, gen- 
eral disturbance (for example, “integration’’) 
may be involved which is expressing itself in 
a variety of ways, rather than associating 
signs with localized brain areas. That a more 
central, general, integrative phenomenon is op- 
erant is suggested in this sample by the find- 
ings that some visual motor disturbance was 
common to the group despite the wide varia- 
tion of central nervous system complaints. 

Seriously ill patients have at times been 
able to describe the reasons for their devia- 
tions, and several of their explanations have 
been illuminating. With some consistency pa- 
tients in acute neurologic states have described 
a conservation of energy as the main reason 
for certain deviations. In verticalizing hori- 
zontally oriented figures, for example, pa- 
tients have stated it “easier” to make flexor 
rather than extensor motions. Other acutely ill 
patients have described with some distress 
their inability to make an angle in one direc- 
tion while demonstrating facility with angula- 
tion in another. That this difficulty in lateral- 
ity has an appropriate cerebral morphological 
correlate or that defects in circumscribed cor- 
tical sensory or motor areas are at least “in- 
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strumental” prerequisites, would seem justi- 
fied since peripheral muscle mechanics offer 
inadequate explanation in most cases. 


SUMMARY 


The validity of certain Bender deviations 
as indicators of cerebral dysfunction was ex- 
amined in 100 child psychiatric patients, aged 
10 to 12. On the basis of their presenting 
complaints, patients were divided into two 
groups designated as suspected brain dam- 
aged and as suspected emotionally disturbed. 
Mental age and socioeconomic status factors 
were controlled. 

An a priori selection of 17 attributes nor- 
mally not occurring after age 8 showed 10 
of these to differentiate the two groups at 
the .01 level. False positive “organic” signs 
in the suspected emotionally disturbed oc- 
curred in but one discriminating attribute. In- 
tercorrelations between the 10 attributes hav- 
ing the best discriminatory power showed 
them, in this sample, to have in general low 
positive correlations with one another. 

These data suggest that the practicing cli- 
nician would do well, when these 10 attributes 
appear in the records of his patients, to con- 
sider “neuronic” rather than neurotic etiology 
for the child’s behavior. For the researcher, it 
is hoped that these findings may serve as a 
building block toward spelling out the hetero- 
geneous nature of that group now referred to 
categorically as brain damaged. 
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A COMPARISON BETWEEN HYPNOTICALLY INDUCED 
AGE REGRESSIONS AND WAKING STORIES 
TO TAT CARDS: 


A PRELIMINARY REPORT 


JOSEPH REYHER 
Michigan State University 


Estimating the degree to which a client in 
therapy has worked through important areas 
of conflict often imposes a difficult judgmen- 
tal task upon the therapist. A method for 
increasing the objectivity of this task was 
developed in connection with an exploratory 
investigation involving the comparison of 
hypnotically induced age regressions to TAT 
cards as stimuli and subsequent waking stories 
to the same cards. The original intent of the 
study was to develop a procedure for reduc- 
ing the artificiality of hypnotically induced 
conflict so the results of such data could be 
more meaningfully interpreted; however, the 
immediate diagnostic significance of the data 
overshadowed some of the more long range 
experimental goals. It was found that differ- 
ences between the content of the hypnotic 
and waking reactions significantly extended 
diagnostic impressions, often reflected unre- 
solved areas of conflict, were a useful psycho- 
therapeutic aid, and provided valuable in- 
sights into hypnosis itself. Psychoanalytic 
theory served as the frame of reference for 
both the psychotherapy and the experimental 
design. 

METHOD 


Subjects 


Five subjects were used. Because of the possibility 
that adverse posthypnotic reactions might occur as a 
result of the stimulation of a subject’s emotional con- 
flicts, only clients were used who were in or about 
to engage in psychotherapy. Furthermore, they were 
given some control over their emotional reactions in 
the waking state in order to reduce the possibility of 
premature activation of problems which involved 
severe conflict. Four of the five subjects were able 
to experience a complete posthypnotic amnesia. The 
one exception usually experienced only partial am- 
nesia. Three of the subjects were characterized by 
predominantly neurotic reactions; the other two 
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were characterized by predominantly psychotic re- 
actions, but they were sufficiently intact to function 
without hospitalization. 


Procedure 


Since the same TAT cards were not suitable for 
both sexes, two sets of 17 cards each were assem- 
bled.2 Eight of the cards were common to both sets. 
A random procedure was used to determine, for each 
subject, the selection of 10 cards from the appropri- 
ate set, the designation of the cards as either con- 
flictual (c) or neutral (n), and the order in which 
the cards were presented.” 

Conflict cards. While hypnotized, the subject was 
told that he would be asked to look at a picture 
that would activate disturbing emotions. He then 
was asked to open his eyes and look at the card. 
After about 10 seconds, he was instructed to close 
his eyes and to go back in time to a period when 
these emotions were very difficult to manage. He was 
then asked to verbalize his experience. 

Neutral cards. The instructions were the same as 
for the c-cards except that the subject was told that 
the emotions to be experienced would not be dis- 
turbing but, nevertheless, would be meaningful. 

Posthypnotic suggestions. All subjects were given 
the following instructions: 


Sometime later, when you are awake, Dr. A. will 

give you the same pictures that I gave you ear- 

lier, and he will ask you to tell stories about them. 

Each picture will stir up the same feelings, emo- 

tions, and ideas that it did before, but you will 

1 The cards unique to the female set were 3GF, 
18GF, 13G, 12F, 9GF, 7GF, 8GF, 17GF, and 2; the 
cards unique to the male set were 8BM, 18BM, 20, 
14, 6BM, 7BM, 12BM, 13B, and 1; the cards which 
were common to both sets were 5, 10, 4, 3BM, 12BG, 
6GF, 13MF, and 17BM. Cards 15, 16, 11, 9BM, and 
19 were not used. 

2 There was one restriction on the random pro- 
cedure which was used for determining the con- 
flictual and neutral designations of the cards. The 
structure of card 13;MF was too extreme in a con- 
flictual direction to risk giving it a neutral desig- 
nation. 
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either reveal them directly or indirectly in the 
stories that you tell. 


The last sentence in these instructions was intended 
to give the subject control over the nature of what 
would be consciously experienced in order to reduce 
the possibility of a traumatic reaction to the pre- 
mature recognition of repressed material. 


Waking condition. The subjects were given the 
cards by Dr. A. with standard instructions. 


RESULTS 


All subjects produced stories in the waking 
state that varied in the extent to which they 
corresponded with the hypnotic reactions to 
the same cards. In order to evaluate objec- 
tively the degree of similarity between the two 
sets of data, the reactions to each card for the 
two conditions were compared in terms of 
three of the most obvious dimensions of simi- 
larity which repression may influence: char- 
acters, situations, and affective-motivational 
state. The degree of similarity for characters 
(C) and situations (S) was assessed by a four- 
point rating scale with the following descrip- 
tive labels and numerical values: personalized 
(O), congruent (1), indeterminant (2), and 
different (3). The affective-motivational state 
(AM) was quantified by counting the expres- 
sions of intentions, needs, and affects by the 
subject in the hypnotic condition (H) and 
by the corresponding character in the post- 
hypnotic condition (PH). The AM units in 
the latter condition were classified as either 
the same (PH,) or as different (PH) from 
the hypnotic condition. Each AM unit was 
counted only once in each of the two condi- 
tions, regardless of how often it may have 
appeared. 


A difference score (D) for each card was 
devised: 


D=C+S+AM 


where: 
AM = =H — PH, + 2=PHa 


If no changes occur in the waking reactions 
to a card, D equals zero. As changes occur in 
the waking state, PH, decreases and PH, in- 
creases. The term PH, is weighted by a factor 
of 2 because it represents a transformation of 
affect, which is considered to be a more com- 
plicated process than simple repression. 

The D scores for the five subjects ranged 
from 5-16, 4-15, 4-16, 6-13, and 5-23. Low 


and high scores were arbitrarily defined as 
those which were in the first and fourth quar- 
tiles, respectively. 

An estimate of the interrater reliability for 
two raters was obtained from rank order co- 
efficients of correlation between D scores for 
two of the protocols that were selected ran- 
domly. Coefficients of .74 and .75 were ob- 
tained and were considered to be sufficiently 
large for research purposes. 

The following protocols were selected in or- 
der to illustrate the type of material that was 
elicited by the c- and n-cards which had high 
D scores. Subjects A and B were similar inso- 
far as initial psychodynamic impressions in- 
dicated that they both had unresolved Oedipal 
conflicts, nonspecific sexual fantasies concern- 
ing an older brother, and confused psycho- 
sexual identifications which appeared to be 


related to parental preference for an older 
brother. 


Subject A, Card 4 (c-Instructions) 


Hypnotic reactions. He reminds me so much of 
daddy. I don’t know—he just seems always to have 
some barrier—something between us. So many times 


I'd like to tell him I love him or give him a big kiss * 


or something, but I can’t. I’m afraid he wouldn’t— 
I don’t know—I don’t think he would push me aside 
or anything but I just never have told him how I 
liked him for a father and all the things he’s done 
for me and I can’t get close to him and I'd like to 
be able to but he just—I don’t know why—maybe 
he’s afraid of women, but I want to so bad all 
through my life. When I was little, I would sit on 
his knee and play with his hands—’cause they were 
bigger than mine, and I had so much fun that way. 
When I got older and wanted him to still treat me 
like that or show that he still liked me, he just 
wouldn’t, and I almost hated him for it—’cause other 
girls’ dads kissed their daughters and took an inter- 
est in them. It seemed he just let me go my own way 
and let mother take care of me and I didn’t want her 
to—I wanted him to. He just wouldn’t, but yet I 
would—I do things for him. Like I’d go golfing with 
him, and I'd drink some beer with him during the 
summer sometimes on a hot night—to please him, but 
that’s all I did it for was to please him, not ’cause 
I liked it. But I can’t—it seems like—I wish I could. 

Waking reactions. Oh, ah-h, let’s say, I believe 
they are married—these two people—and he’s the 
kind of a person that would love to tell his wife 
how he feels about her but he can’t quite and she al- 
most craves it. She wants him to tell her all those 
little sweet nothings and he finds it hard to do so 
and therefore—they don’t understand one another 
and she—tries to console him, but she can’t because 
she doesn’t understand it. 
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Age Regressions and Waking Stories in TAT 


These reactions reinforced the initial psy- 
chodynamic impressions concerning sexual in- 
clinations toward her father. 


Subject A, Card 8GF (n-Instructions) 


Hypnotic reactions. ’'m watching my brother and 
he’s painting, and I admire him so much. I think he’s 
kind of like a genius or something—the way he can 
create pictures out of paint. I couldn’t do that and 
I'm kind of jealous of him. Still, he was my brother 
and I thought he would make something of himself 
and IJ could be his sister. He used to make the most 
beautiful scenes. He couldn’t paint people, just 
scenery and things. They were so pretty. 

Waking reactions. This girl really has a dreamy 
look in her eyes. She is in a classroom and she 
admires her—oh—English teacher very much. She 
thinks to herself that someday she will be just ex- 
actly like her. Is that short enough? That is, I 
mean, does it make any difference how long or short 
they are? 


Her underlying attraction to her brother 
becomes very clear. Substitution of the word 
“wife” for her illogical use of “sister” would 
be more congruent with unacceptable fantasies 
concerning her attraction to her brother. At 
no previous time had there been any evidence 
that homosexual tendencies might be involved. 
The waking reactions indicated that these 
tendencies may be in the service of defense 
against incestuous fantasies. The obvious de- 
fensive disturbances in the waking reactions 
reinforce these impressions. 


Subject B, Card 7GF (c-Instructions) 


Hypnotic reactions. I am thinking about two things 
and I can’t separate them. (What are they?) It seems 
that I’m sitting on my mother’s lap and she is telling 
me about having my first menstrual period and I 
also seem to think at the same time about being tied 
in a chair. I can’t—I can’t quite remember, I know 
I used to get tied in the chair when I was bad and 
I don’t know why. 

Waking reactions. The mother is reading a story 
to the little girl—and the little girl is holding a doll 
in her arms—and tells her that someday she might 
become a mother—and be the kind of a person that 
her mother is reading about in the book. (How is 
she feeling?) She feels as if she would like to grow 
up and it seems a long way off. (How will it end for 
her?) She will grow up and have a baby. (Laughs.) 


In subsequent therapeutic sessions, succes- 
sive dreams were induced in the subject re- 
garding the emotions and thoughts behind the 
image of sitting on her mother’s lap learning 
about menstruation and the image of being 
tied to a chair. The induced dreams to men- 
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struation were related to psychosexual con- 
fusion which progressed to an abreaction of 
her anxiety regarding her wish to be a boy. 
The image of being tied to a chair was re- 
lated to a traumatic early childhood experi- 
ence in which she heard a voice, while she 
was in bed, telling her to kill her mother. 


Subject B, Card 17GF (n-Instructions) 


Hypnotic reactions. My daddy and I used to ride 
on bicycles through the—part of the canal—and it 
was very enjoyable for me—by the water and a 
bridge. I always enjoyed it—riding on a bicycle there 
with my dad, because we always liked the water— 
threw stones in it. (Is there anything else?) I—used 
to like it, that’s all. 

Waking reactions. It seems to be a granary of some 
sort—there are sacks of grain on the ground. There’s 
a girl on a bridge. In the background is the granary 
and she would like to leave it, but she can’t make up 
her mind whether she wants to or not. I think she 
would like to be on a boat and go away. (What will 
finally happen?) I don’t think she will. (Can you 
make up a story about why she is torn between 
leaving and staying?) She feels that the people in 
the house will miss her if she goes away. She would 
like to go somewhere else. 


The material produced in the hypnotic re- 
action was related to the many dreams she 
had concerning water. Successive dream in- 
duction produced the reliving of an experi- 
ence in which she was riding on her father’s 
back in the water while having thoughts that 
she wanted to possess him completely. 

Subject C was a tense, conscientious, mar- 
ried female who was successfully engaged in 
one of the medical specialties. Marital difficul- 
ties constituted the presenting problem. 


Subject C, Card 17GF (c-Instructions) 


Hypnotic reactions. This is my fear of high places 
for myself or to see someone else in a high place, 
even in the movies or TV, when they’re out on a 
ledge. When we were at the Grand Canyon, other 
people would lie on their stomachs and would look 
over the edge. Why, I don’t know, unless heights 
symbolized to me jumping and suicide—trying to 
stay away from the situation or avoid it. Well, it’s 
not so much fear of jumping myself as an accident, 
of someone not meaning to but accidentally pushing 
someone over or falling. Well intentions and yet un- 
avoidable. I have thought briefly in terms of suicide 
but whether I would ever be able to commit suicide, 
I decided that I would never be able to. (Anything 
else?) No, I don’t believe so. 

Waking reactions. A woman on a high bridge feel- - 
ing very despairing, uh—considering jumping. She 
feels that things haven't gone as they should but she 
isn’t going to jump. 
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The italicized portions reveal the underly- 
ing hostile impulses which were betrayed in 
terms of rather direct verbal representations. 


The attempts to cover up were ineffectual, 


and defensive reactions due to the marked 
breakthrough of murderous impulses were not 
present. Any such slips in the waking state 
probably would have activated a host of au- 
tonomic reactions and vigorous defensive ac- 
tivity. The brief and impoverished waking 
reaction supports this interpretation. No high 
D scores were associated with n-cards. 


DIscussIoN 


In view of the omnipresent possibility of 
artifact due to the motivation of hypnotized 
subjects to please the experimenter by behav- 
ing in a manner consistent with what they 
believe is expected of them, caution must be 
exercised in the interpretation of the results. 
In terms of the procedure, it would seem 
likely that subjects would perceive that the 
instructions to the c-cards were of special in- 
terest. If subjects had surmised correctly that 
the experimenter had expected greater differ- 
ences between the hypnotic and waking reac- 
tions to c-cards than to n-cards, there should 
have been higher D scores associated with the 
former. Although this was true, the material 
that was produced was more closely involved 
with potent areas of conflict than material 
which had come up at that point in therapy. 
Aside from whatever role a desire to please 
may have played in producing the material, 
it proved to be extremely useful in achieving 
a more sharply defined and delineated psycho- 
dynamic impression. 

The enhanced clarity of the psychodynam- 
ics to c-cards with high D scores supports 
White’s (1941) observation that hypnosis is 


an altered state of awareness in which there is . 


a contracted frame of reference and lack of 
such critical functions as a sense of humor 
and self-consciousness. It may be that a re- 
duction in these functions proportionately re- 
duces the anxiety producing potential of re- 
pressed intrapsychic stimuli which enables 
them to become more clearly represented in 
awareness. The most perplexing and unex- 
pected data, however, were the high D scores 
for a number of the n-cards to which the sub- 
ject had experienced positive affect in the hyp- 
notic reactions. Previous psychodynamic im- 
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pressions indicated that this was not a case 
of simple forgetting, because the high D scores 
involved material related to the most central 
areas of conflict. Unlike the c-cards, material 
of equal or greater anxiety producing poten- 
tial was reacted to with positive affect. The 
two sets of instructions appeared to elicit dif- 
ferent kinds of unconscious processes or dif- 
ferent facets of the subjects’ conflicts. 

The observed differences in the reactions to 
c- and n-cards with high D scores are prob- 
ably related to the difference in the instruc- 
tions for these cards. The c-instructions re- 
quired the subject to react with disturbing 
affect to whatever was brought to mind by the 
card. In response to this, the subject seemed 
to recall some previous conscious experience 
that was distressing to him. When this reac- 
tion was experienced, underlying repressed 
aspects of this material became more clearly 
represented. The high D score, therefore, 
seems to be a function of the potential threat 
of perceiving this same material with the en- 
hanced critical functions of the waking state. 

The n-instructions merely required the sub- 
ject to respond with reactions that were not 
disturbing but, nevertheless, meaningful. Un- 
like the c-instructions, the n-instructions made 
a comparatively nonspecific request, insofar as 
the subjects’ reactions to the card were not 
restricted to one class of affect. It may be 
that the positive reactions are a function of 
hypnosis as an altered state of awareness, and 
that the same material would activate anxiety 
in the waking state. This interpretation is con- 
sistent with psychoanalytic theory in that 
anxiety and defense are produced by repressed 
material that is striving for an outlet and 
gratification. Again, the high D score prob- 
ably is a function of the enhanced critical 
abilities of the waking state in which the per- 
ception of such material would activate a 
prohibitive degree of anxiety. 

In order to pursue the implications of the 
above interpretation of the high D scores to 
n-cards, we have been instructing clients un- 
der hypnosis to let their minds wander as they 
do just prior to dropping off into a deep sleep, 
with images, dream-like thoughts, and feelings 
coming without any conscious effort. The 
client is not instructed to talk until he is 
awakened, at which time he is simply asked 
to talk about his reactions. The suspension of 
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conscious effort and talking are designed to 
reduce critical functions (secondary process) 
that would be activated by the need to com- 
municate. Although our experience with this 
method is still limited, the results have been 
very encouraging. When the spontaneous im- 
ages, fantasies, dreams, and memories which 
are produced become too transparent, a de- 
fensive posthypnotic amnesia may occur; 
however, subsequent hypnosis frequently can 
recover much of this material. 

Cards 8GF and 7GF for Subjects A and B, 
respectively, illustrate that alterations of af- 
fect and motivation can masquerade as health. 
Most of the waking reactions, however, con- 
tained some clues as to the presence of un- 
derlying conflicts, but the alterations often 
were so great that the nature of the actual 
conflict could not be readily inferred without 
benefit of the hypnotic reactions. The most 
productive method of analysis with these data 
was to consider the hypnotic and waking re- 
actions jointly. The hypnotic reactions were 
often closer to conflictual material, whereas 
the waking stories gave a better picture of de- 
fensive reactions. Thus, successive dream in- 
duction and a combined analysis of both types 
of TAT data present a better opportunity to 
assess the nature of the underlying anxiety 
producing processes and attendant defensive 
reactions at three different levels of psychic 
representation. 

Not only do high D scores provide valuable 
diagnostic information about the status of the 
subject’s conflicts and the degree to which he 
has worked them through, but they sometimes 
open up areas previously unknown to the 
therapist. One method of utilizing the high D 
scores in therapy is to give the subject the 
posthypnotic suggestion to recall all the age 
regressions, with the provision that he can 
forget what he does not want to remember at 
this time. If the age regressions to the cards 
with high D scores are not recalled by the 
subject in the waking state, further evidence 
of the dynamic significance of the material is 
obtained. These areas then become the foci 
of subsequent hypnoanalytic sessions. 

It is not surprising that the subject may re- 
member some of these age regressions in the 
waking state because the hypnotic material is 
still relatively disguised. If subsequent waking 
and hypnotic interviews are unproductive in 
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further uncovering and working through of 
the material denoted by a given high D score, 
specialized hypnoanalytic procedures can be 
used. One such procedure which is particu- 
larly well suited for this purpose is successive 
dream induction concerning the emotions and 
experiences that produced the content of the 
age regression. Examples 7GF and 17GF for 
Subject B illustrate this procedure. 


SUMMARY 


The clinical and experimental value of hyp- 
notic and waking reactions to TAT cards was 
investigated. Ten TAT cards were selected 
randomly to be conflictual or neutral for five 
psychotherapy clients who were capable of 
deep hypnosis. The conflict-inducing instruc- 
tions were intended to activate disturbing re- 
actions to TAT cards. This was followed by 
an age regression to a period earlier in life 
when the activated reactions were particularly 
difficult to manage. The nonconflict-inducing 
instructions were the same except that they 
were intended to activate nondisturbing, but 
meaningful, reactions. 

A difference score (D score) was devised in 
order to evaluate quantitatively differences in 
characters, situations, and affective-motiva- 
tional states between the hypnotic and wak- 
ing reactions to each card. Both conflict cards 
and neutral cards had high D scores which 
reflected areas of conflict that had not yet 
come up in therapy and areas that had not 
been worked through adequately. 

Neutral cards with high D scores, unlike 
conflict cards with high D scores, were asso- 
ciated with positive affect. It was concluded 
that the conflict cards activated previous con- 
scious, conflictual experiences which more 
clearly revealed underlying repressed mate- 
rial than the waking reactions to the same 
cards. The high D scores to the neutral cards 
were interpreted as evidence that hypnosis is 
an altered state of awareness in which uncon- 
scious drives tend to be perceived in terms of 
gratification rather than threat. A procedure 
was described for utilizing this material in 
psychotherapy. 
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Several investigators have suggested that 
the degree to which a person’s speech departs 
from its usual level of coherence and economy 
is a likely indicator of anxiety (Dibner, 1956; 
Eldred & Price, 1958; Mahl, 1956). The fol- 
lowing experiment was designed to evaluate 
the predictive validity of measures of such 
speech disruption as indicators of anxiety. 


MEASURES 


Both Mahl (1956) and Dibner (1956) de- 
scribe measures of speech disruption. The two 
measures appear similar and produce highly 
correlated total scores: the median for 15 
cases was .91 (Krause, 196la). We felt that 
these measures could be improved with regard 
to their “psychological meaningfulness,” reli- 
ability, and range of sensitivity. In the first 
place there was a good deal of overlap be- 
tween the two measures. Mahl’s “sentence 
incompletion,” “repetition,” and “stutter” 
(plus “omission”) categories resemble Dib- 
ner’s “unfinished sentence,” “repeating words 
or phrases,” and “stuttering or unfinished 
words,” respectively. We attempted to re- 
group the various speech disruptions into 
categories which referred more uniformly to 
the expression of complete thoughts and which 
might be interpreted in terms of the function 
served by the disruption in the flow of speech. 
To these ends we adopted Dibner’s “break,” 
Mahl’s “correction,” their common “unfin- 
ished” or “incomplete,” and pooled their 
“repetition” and “stutter” categories. Mahl’s 
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“tongue slip” and “omission” and Dibner’s 
“omission” (part of his “stutter” category) 
we combined into “distortion,” Dibner’s “I 
don’t know,” “sigh,” “laugh,” and “ques- 
tion,” as well as Mahl’s “intruding incoherent 
sound,” were pooled in our “intrusion” cate- 
gory. The Mahl “Ah” category we expanded 
as “procrastination.” The following set of de- 
scriptions specifies our categories of speech 
disruption. 


SPEECH DISRUPTIONS 


B (break). The continuity of one line of 
thought is broken by the intrusion of another. 
The break is usually grammatically disruptive 
but it is the interruption of an incompleted 
line of thought by another, different thought 
that is distinctive. There must be an actual 
shift of topic, so that the interrupted mate- 
rial seems to have been displaced from the 
speaker’s attention. Therefore a B is not 
scored when the interjected material is a com- 
mentary (even if rather anticipatory) upon 
or a correction of the interrupted material. 
Thrown-in statements, references to the in- 
terviewer, and questions can be breaks. 

C (correction). Something is stated and 
then corrected within the same sentence. It 
may be a word, phrase, or clause which the 
correction is to replace, but in all cases the 
subject of the corrected and correcting mate- 
rial seems to be the same. Thus, the correc- 
tion is usually one of pronunciation, empha- 
sis, grammar, form, degree, specificity, or fact. 
It must not modify nor explain but replace 
the corrected material. The subject’s concep- 
tion seems to be that he has said the wrong 
thing, has misspoken himself, and must replace 
his word or words. A clue to this is that both 
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corrected and correcting elements fit the con- 
text. 

F (fragment). A clause or sentence is left 
incomplete either in meaning or grammar. 
Whenever such a fragment occurs it is scored 
an F, even if it is apparently completely ex- 
pressed in another independent try. Thus, 
when a sentence is interrupted by a qualify- 
ing clause and then the interrupted portion is 
repeated or replaced, this portion is a frag- 
ment regardless of how similar are the origi- 
nal and resumptive materials. An intended 
clause interrupted and replaced by another is 
an F, although the larger sentence into which 
both the interrupted and interrupting clauses 
fit is not an F. Answers to questions must be 
judged as to whether they are adequately com- 
plete responses so not judged independently 
of the question. When the material following 
a possible F could be construed to complete 
it, continuity or discontinuity of intonation is 
a clue, for an F often involves a change be- 
tween the fragment and what follows it. 

D (distortion). Mistakes or distortions of 
proper speech occur involving either grammar 
or meaning. They may be unintended words, 
including neologisms and tongue slips, incor- 
rect words or grammatical errors which are 
not habitual, or mispronunciations. These 
distortions are recognized by their impro- 
priety, prima facie or as inferred from the 
context (often from the correcting material). 
Word omissions are scored Ds where without 
the ostensibly omitted word neither portion 
of the statement (before or after the omis- 
sion) would stand as a complete sentence or 
thought. 

R (repetition). Certain phrases, words, or 
parts of words are perseverated upon by repe- 
tition rather than prolongation. The flow of 
speech is impeded by redundant elements such 
as stutters, word or phrase repetitions, and 
changes between contracted and uncontracted 
forms. Repetitions share certain character- 
istics with the repeated material which dis- 
tinguish their perseverative quality. These 
characteristics may involve pitch, timbre, 
loudness. The paradigm for R is the stutter. 

P (procrastination). The speaker seems to 
procrastinate, to delay getting on to his next 
point. He does this by means of delaying 
sounds (such as “ah,” “um,” or prolonged 
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vowels), words (such as “well”), or phrases 
(‘How shall I say?” sometimes even “I don’t 
know”). That he is controlling his speech and 
thinking ahead distinguishes procrastinations 
conceptually from repetitions. Many phrases 
which also serve other functions may partake 
of procrastination. These include “let’s say, 
let’s see, as a matter of fact, that is, for ex- 
ample, so to speak, I mean,” etc. “Well” and 
“Oh” are to be scored as procrastinations un- 
less they clearly lack this quality, as when 
they are purely exclamatory. 

I (intrusion). A nonverbal sound intrudes 
upon the flow of speech, but it occurs mean- 
ingfully like a break, rather than as a mere 
supporting procrastinating or background 
noise. It may be a sigh, laugh, cough, throat 
clearing, deep breath. 

The reliability attained by two independent 
raters on the experimental material, rating 
from typescripts with the recording being 
played, was 88%. This represents exact agree- 
ment, i.e., the number of disruptions scored 
identically as to location and category di- 
vided by total disruptions scored. Contained 
in the denominator are the number of disrup- 
tions scored by one and not the other rater, 
scored differently by each, and scored identi- 
cally by each. Over our protocols this coeffi- 
cient ranged from a low of 71% to a high of 
93%, with an interquartile range of about five 
percentage points. One-fourth of the protocols 
were rated after a lapse of 1 month, but the 
interrater agreement did not diminish. In or- 
der that response scores might be comparable 
over individuals with different speech outputs, 
scores were defined as the number of instances 
of occurrence for a particular category di- 
vided by the total number of words in the 
response. 


EXPERIMENTAL DESIGN 


The crucial design problem in this study 
was to provide anxiety criteria against which 
to test our measures. We employed a two-part 
criterion for recognizing instances of anxiety 
and nonanxiety. First, we developed a set of 
descriptions of what seemed stressful and non- 
stressful situations (i.e., stimuli). Second, we 
questioned the subjects as to what their feel- 
ings were while they were describing their 
probable reactions in these situations. Those 
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verbal responses to the presumably stressful 
situations which were reported to involve “a 
major element of tension, uneasiness, or ap- 
prehension . . . fear or fright” or anxiety, 
and no other emotions, were used as criterion 
instances of anxiety; while the responses to 
presumably nonstressful situations which were 
reported to involve no such element or any 
other emotion, were used as criterion instances 
of nonanxiety. We set aside, as irrelevant for 
validation purposes, the data on responses in 
which a stressor invoked no report of a fear 
or anxiety feeling, a nonstressor did invoke 
such a report or either invoked the report of 
some other emotion. 

These criteria were used because they rep- 
resented, in our opinion, the best evidence 
of anxiety that could be collected under 
the circumstances. The argument upon which 
this opinion rests is presented in detail else- 
where (Krause, 1961b), also see Grinker, 
Korchin, Basowitz, Hamburg, Sabshin, Persky, 
Chevalier, and Board (1956). Briefly, certain 
stimuli are on the face of them more stress- 
ful than others. Our confidence that anxiety 
is produced by them is greater than that it 
is produced by apparently innocuous stimuli. 
Our rationale for using the introspective re- 
ports, the second part of our criterion, is that 
one cannot feel afraid or anxious and yet not 
be subject to this emotion, for the feelings are 
sufficient evidence of the emotion’s presence. 
It is possible, however, for one to be anxious 
without feeling anxious, thus if a nonstressor 
which induces no feeling of anxiety does pro- 
duce speech disruption, the effects of predic- 
tive invalidity and “unconscious anxiety” will 
be confounded. The difficulty this involves 
for us is that we may get considerable speech 
disruption on trials classified as nonanxiety 
trials. This would tend to diminish the differ- 
ence in speech disruption between these con- 
trol trials and the experimental (anxiety) 
trials, thus decreasing the sensitivity of our 
experiment. 

The experiment was performed, ostensibly, 
as a survey of how people react in disasters, 
under the title Disaster Relief Research. The 
subjects were recruited by means of a letter 
aimed at arousing both their interest and 
public spirit. It was sent to a systematic ran- 
dom sample (Cochran, 1953, pp. 160-168) of 
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University of Michigan students drawn from 
the summer directory. Only 19 of the 60 re- 
cipients volunteered to act as subjects, but we 
have no reason to believe our results biased by 
the subjects’ self-selection. 

Reinforcing the impression created by the 
letter, an introduction was given to each sub- 
ject before the experimental session. The study 
was presented as an attempt to discover what 
public reaction to disasters would be. We 
stressed the practical value of such knowl- 
edge, its scarcity, and the subject’s ability to 
provide it. The interviewer endeavored to 
make the session a collaboration in which the 
subject did his best to place himself in the 
situations described and spontaneously report 
his reactions, while the interviewer asked for 
elaborations or clarifications. This “cover” 
was essential to promote the subject’s involve- 
ment, without which the stressors would be 
ineffective, and his honesty in reporting feel- 
ings, without which we could mot assert any 
correspondence between feelings and reported 
feelings. The control achieved by our cover, 
like any control upon which it is infeasible 
to collect independent data as to its efficacy, 
is open to question, of course, but we have no 
reason to suspect that the subjects did not 
believe and report as we intended. Their in- 
volvement is evident in the recordings of their 
sessions. 

The greatest danger of subjects’ dishonesty 
biasing our results is in their possible tend- 
ency to report conventional, socially accept- 
able feelings. We were especially concerned 
about reports of anxiety feelings’ presence or 
absence in response to what the subjects sup- 
posed the interviewer would expect, rather 
than to the subjects’ actual feelings. The 
situations were not ambiguous within pairs as 
to the relative degree of danger, discomfort, or 
emotional arousal supposed to be involved.? 


2 The existence of such a social acceptability re- 
sponse-set is questionable here, since 58 out of 190 
reports of feelings (i.e., from 1 to 9 for each subject) 
were unconventional in this sense. 

One collateral finding is interesting in this regard. 
The unconventional responses of the 11 females in 
our sample show a surprising regularity. Those that 
scored over 12 on a 50-item sample of the Manifest 
Anxiety scale err only in reporting anxiety to non- 
stressors, while those scoring under 12 err only in re- 
porting nonanxiety to stressors. The degree of this 
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This would tend to give us convervative re- 
sults in the predictability of anxiety by speech 
disruption, i.e., any positive findings will tend 
to be understated. 

We selected 10 pairs of situations, a stressor 
and a nonstressor in each. They all had some 
relevance to disasters and ranged from a first 
aid training proposal and Conelrad, through 
frustrated attempts to help others, to situa- 
tions of great personal danger. They were 
paired in an attempt to minimize the differ- 
ences, other than stress, between each stressor 
and its nonstressor mate. Thus, their posi- 
tions in the series of 20 were contiguous and 
the degree and manner in which the response 
was structured by the situation and probe 
questions were equated within pairs. A greater 
similarity of within pair subject matter would 
have been desirable as well, but for the greater 
danger of overlap between the emotional ef- 
fects of stressor and nonstressor. The inter- 
pair order was arranged so that the subjects 
would not perceive a stressor-nonstressor pat- 
tern and so that the apparently more intense 
stressors did not have a carryover effect on 
other stressors. 

In summary: We exposed each of 19 sub- 
jects to 10 pairs of stimuli presented in a 
standard order. The subjects were under the 
impression that they were participating in a 
survey of how people might be expected to 
feel and react during disasters. Those of their 
pairs of responses which met our validity cri- 
teria were studied in terms of a measure of 
speech disruption for the relationship between 
anxiety and this measure. 


RESULTS 


Thirty-seven pairs of responses over 13 
subjects (from a total of 190 pairs over 19 
subjects) were appropriate for study. We were 
relatively ineffective in controlling the stress- 
fulness of our stimuli, because we lacked prior 
knowledge of the subjects and used a stand- 
ard set of stimuli for all subjects. Some recent 
work of Lazarus (1960) may prove helpful in 
the future selection of stimuli. 

The seven categories of speech disruption 
were individually scored for each of the 37 


“error” varies positively with MA scale score, r = 0.77 
(significant at the .01 level). 


TABLE 1 


SpeEecH DisruPTION CATEGORIES AND PERCENT Cor 
RECT, INCORRECT, AND INDETERMINATE 


Category 


Prediction I 
Correct 78* 
Incorrect 14 
Indeterminate 8 


* Significant at the .05 level, one-tailed. 


response pairs. If the sign of the difference 
(anxiety response score minus nonanxiety re- 
sponse score) was positive the category pre- 
dicted correctly for that pair. If the sign of 
the difference was negative, it predicted incor- 
rectly. And if there was no difference, it failed 
to discriminate. The differences for each cate- 
gory, alone and in combination with others, 
were tested for significance by the normal ap- 
proximation to the binomial for the differ- 
ence between two percentages. The results in 
Table 1 are the percentages obtained for each 
of seven categories viewed separately. 

By combining categories we can obtain a 
slightly higher percentage of correct predic- 
tion than afforded by any individual category 
alone. The best simple combination seems to 
be to use I, and where it fails to discriminate 
use F. This combination predicted correctly 
in 86% of the cases and incorrectly in 14% 
of the cases. 

Both Mahl and Dibner have used the sum 
of several categories for a speech disruption 
score. We can approximate their scores by 
combinations of our category scores. The 
Mahl non-Ah ratio (Mahl, 1956) is akin to 
the sum of all our categories excluding I and 
P, while Dibner’s Cue-Count 1 (Dibner, 
1956) resembles our sum minus only I. 

Despite the intended similarity between the 
Mahl non-Ah ratio and our sum of categories 
excluding I and P, it should be pointed out 
that our approximation of this ratio discrimi- 
nated correctly in only 57% of the cases. A 
somewhat higher figure was reported by Mahl 
and Kasl (1958), in a similar experiment, for 
the non-Ah ratio suggesting the possibility 
that the scoring criteria were not identical. 
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TABLE 2 


SomE COMBINATIONS OF SPEECH DisRUPTION CATE 
GORIES AND PERCENT CORRECT, AND INCORRECT, 
INDETERMINATE 


Combination 
Prediction Total Non-I Non-I & Non-P 
Correct 70* 62 57 
Incorrect 30 38 40 
Indeterminate 0 0 3 


* Significant at the .05 level, one tailed. 


It does not seem that the extra effort re- 
quired to score the categories other than I 
(with the possible exception of F) is war- 
ranted by their incremental effect upon va- 
lidity. It is interesting, however, that speech 
disruption categories as a whole are, when 
pooled, statistically significant indicators of 
anxiety. This suggests that speech disruption 
may be a useful (unitary) concept. 

In order to give the reader some basis for 
comparing the validities of the disruption 
measures with those of other verbal anxiety 
measures, we include the results on three 
other suggested measures for which it was fea- 
sible to score our data. These are number 
of words spoken in the subject’s response 
(Balkan & Masserman, |940), the verb-adjec- 
tive ratio (Balkan & Masserman, 1940), and 
the latency of the subject’s response (Benton, 
Hartman, & Sarason, 1955). The results, por- 
trayed in Table 3, emphasize the effectiveness 
of I (with or without F) as an indicator of 
anxiety. 


TABLE 3 


NONSPEECH DISRUPTION CATEGORIES AND PERCENT 
Correct, INCORRECT, AND INDETERMINATE 


Category 
Number of Verbs/ 

Prediction Words Adjectives Latency 
Correct 59 65 62 
Incorrect 41 35 24 
Indeterminate 0 0 14 
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DISCUSSION 


The sorts of explanations alternative to 
anxiety which may be offered to account for 
speech disruption depend upon the situation 
in which the subject is placed. If we ask him 
to describe vague, ambiguous, or half forgot- 
ten experiences, then procrastinations, correc- 
tions, and fragmented sentences may well be 
frequent. If the interviewer is rather reactively 
mobile in his facial expression, this may in- 
duce a good deal of repetition or correction 
by the subject. An overly friendly or humor- 
ous interviewer can multiply ‘intrusions of 
laughter, while one who nods his understand- 
ing anticipatorily may encourage distortions 
and fragmentation. If the subject does not 
perceive the situation as one calling for par- 
ticularly coherent, grammatical, task-oriented, 
and economical speech—as he might not if he 
did not consider his role a serious one or if 
the interviewer were an old friend—he may 
not try to achieve a high level of disruption- 
free speech. We attempted to avoid these 
sources of speech disruption by our choice of 
stimuli, instruction of the interviewer, and 
experimental cover. Furthermore, these influ- 
ences tend to affect the absolute level of 
speech disruption and not its variations over 
the several stimuli presented. 

There are, however, peculiarities of our ex- 
perimental situation which may limit the gen- 
eralizability of our major results. The subjects 
were asked, in effect, to play-act. There were 
no “real” dangers in the situation perceptible 
by the subjects but for becoming excessively 
distraught by playing too well or offending 
the interviewer by getting too involved in 
the play and thereby “revealing” themselves. 
These dangers might be relieved by maintain- 
ing or shifting into a role-alien (e.g., an ob- 
serving ego) viewpoint and so achieving some 
perspective or “emotional distance” from what 
one was saying or thinking. The predominant 
component of I, laughter, was, as we defined 
it, peculiarly well fitted as an indicator of 
such a stratagem. Where the subject became 
threatened, he may have laughed as an indi- 
cation of his lack of commitment to the on- 
going remarks, for one can most certainly dis- 
cern this subtle intrusive quality in much of 
the subjects’ laughter. This is not to say that 
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the subjects were not anxious then, but it does 
suggest that in a more thoroughly real or 
spontaneous situation (e.g., using sudden in- 
tense stimuli as stressors) intrusive laughter 
might be more rare, as Bs were in this situa- 
tion, and so fail as a predictor of anxiety. 


SUMMARY 


In order to assess the predictive validity of 
speech disruption as an indicator of transi- 
tory anxiety, we exposed 19 subjects to 10 
stressors and 10 nonstressors. The combina- 
tion of stressor and reported feelings of anx- 
iety was the criterion of anxiety’s presence 
and that of nonstressor and reported absence 
of anxiety feelings was the criterion of anx- 
iety’s absence. We found that intrusive non- 
verbal sounds, mainly laughs and sighs, were 
the most correct predictors. 
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Caseload statistics at Pittsburgh Child Guid- 
ance Center indicate that 28% of all families 
accepted for treatment terminate their con- 
tact unilaterally and at a time which the 
clinic staff considers premature. This experi- 
ence seems similar to that of other clinics, for 
Levitt (1958) states that some report a drop- 
out rate of more than 30%. Premature termi- 
nation is costly because as many as 15 hours 
of scarce professional time may have been 
spent in diagnostic study and related activi- 
ties before the family decides to discontinue 
the contact. This poses a practical problem. 
At the same time these families are of theo- 
retical interest because, though carefully se- 
lected, they are unable to make use of col- 
laborative treatment, the traditional approach 
of child guidance clinics. 

Some families seem to derive benefit even 
from a truncated contact (Inman, 1956) so 
that the professional time may not be wasted 
completely, but if potential terminators could 
be identified early in the proceedings, they 
might be provided with a modified service, 
more suited to their needs. The staff time thus 
freed could then be made available to other 
families better able to use the traditional child 
guidance approach. 

This study was designed to explore vari- 
ables which might help differentiate between 
terminators and remainers. A number of in- 
vestigators have attempted to identify charac- 
teristics of families who remain in treatment 
and of those who terminate prematurely. 
Levitt (1958) found that judgments of cli- 
nicians, made on the basis of diagnostic in- 
formation, did not identify “defectors” suc- 
cessfully; nor did judgments of motivation 
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for treatment and severity of symptoms dif- 
ferentiate between the two groups. 

Hofstein (1957) has pointed out that in a 
child guidance clinic, assessment of a child’s 
treatability must rest on the evaluation of 
the parents’ capacity to involve themselves in 
the treatment process and to work toward a 
change in their relations to each other as well 
as to their child. The important contribution 
of parental attitudes to continuance in child 
guidance treatment has also been stressed by 
Inman (1956) and Smigelsky (1949). 

Lake and Levinger (1960) found differences 
in parental attitudes when they compared 50 
continuers with 50 discontinuers. The parents 
of continuers tended to be more aware of the 
child’s disturbance and of their own con- 
tribution to it. They were more inclined to 
see the problem as something for which the 
family as a whole was responsible and ac- 
cepted that they themselves had to participate 
in finding a solution. They also displayed 
greater cooperation during interviews and 
tended to agree with the worker on the na- 
ture of the child’s disturbance. 

Levitt (1957) compared defectors with re- 
mainers on 61 variables and found 5 which 
seemed to differentiate between the two 
groups. The variables did not form a mean- 
ingful cluster, nor did there appear to be 
“any theoretical reason to expect them to be 
differentiating.” In addition, the probability 
of finding 5 analyses out of 61 significant at 
the .05 level is very nearly .25. These consid- 
erations led Levitt to ascribe his results to 
chance. The present study attempted to an- 
ticipate this dilemma by testing a priori pre- 
dictions made on the basis of logical and 
theoretical considerations. 
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METHOD 


This center prepares an IBM punch-card record at 
the time each case is closed. The record is coded on 
67 categories, ranging from objective baekground de- 
scriptions of the child and the family, to items deal- 
ing with developmental and health history, present- 
ing symptoms, diagnosis, and clinic procedures.1 The 
system includes every case closed since 1943 and, at 
the time of this investigation, contained 2,400 records. 
A study by Gilbert (1957) suggests that girls referred 
to child guidance clinics do not come from the same 
population as boys. Therefore, only boys’ records 
were used for the present analysis. There were 1,497 
boys among the cases. Terminators and remainers 
were drawn from this population. 

Terminators were defined as families who had en- 
tered treatment but discontinued therapy on their 
own decision before five treatment interviews with 
the child had taken place. A total of 107 cases met 
these criteria. 

Remainers were defined as families who had en- 
tered treatment, continued for more than 16 treat- 
ment interviews with the child, and terminated either 
by mutual or clinic decision. A total of 154 cases met 
these criteria. 

Of the 67 categories of information available on 
each case, 27 were chosen for analysis. Those cate- 
gories were selected in which the information was 
specific enough for research purposes and relevant to 
aspects of parental attitudes toward the clinic or 
toward the child. Specific predictions of the relations 
between the variables and the continuance dichotomy 
were made and recorded before analysis of the data. 
In four of the categories more than 1 prediction was 
made so that a total of 32 discrete predictions was 
put to test. 

Some predictions were based on results of studies 
reported in the literature (Affleck & Mednick, 1959; 
Lake & Levinger, 1960; Levitt, 1957; Rubenstein & 
Levitt, 1957; Tuckman & Lavell, 1959). Other pre- 
dictions were based on parents’ ability to participate 
in treatment, which seemed relevant on the basis of 
clinical experience. 


RESULTS 


Of the 32 predictions, 9 could not be tested 
because insufficient information precluded a 
meaningful analysis. These hypotheses related 
to infantile behavior, tomboy or effeminate 
behavior, father’s employment status, moth- 
er’s employment status, nature of family resi- 
dence, and three others related to diagnostic 
categories. Five of the 32 predictions were 
confirmed at better than the .02 level of sig- 


1 This system was instituted by William F. Finzer, 
Director of the center, and is maintained by Sallie 
Churchill. To them, and to David S. Lepson, who 
assisted with the analysis, the authors express their 
appreciation. 
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nificance. When 32 comparisons are possible, 
the probability of obtaining five differences, 
significant below the .02 level, by chance is 
less than .0001 (CR = 5.51). This calcula- 
tion is based on the approximation described 
by Brozek and Tiede (1952). 

For the remaining 18 predictions the re- 
sults failed to reach a satisfactory level of sta- 
tistical significance, although the differences 
were in the predicted direction in all but five 
of the comparisons. 


Confirmed Predictions 


Compared to terminators, remainers have a 
greater proportion of histories of develop- 
mental difficulties (complications in weaning, 
toilet training, delayed speech, reduced social 
responsiveness) (x? = 10.12, p < .01).? 

The classification “Unusual Behavior” (con- 
fusion, disorientation, panic reactions, unpre- 
dictable, meaningless, self-destructive 
acts) was found more frequently among the 
remainers than among the terminators (,’ 
= 11.49, p < .001). 

There was a higher incidence of marital 
disharmony (excluding divorce and separa- 
tion) among the remainers than among the 
terminators (,* = 5.96, p < .02). 

Compared to terminators, remainers con- 
tained more cases where, in addition to the 
child’s individual therapy, both parents were 
seen individually for treatment. Among the 
terminators only one parent (usually the 
mother) tended to be in concurrent treatment 
(x? = 15.25, p < .001). 

Termination was positively related to ob- 
taining clinic service immediately after ap- 
plication, without having to wait for the in- 
take interview; remainers tended to be fami- 
lies who had to wait for service (y? = 29.16, 
p < .001). 

In addition to the results which confirmed 
specific predictions, two findings for which no 
hypotheses had been formulated emerged as 
the data were analyzed. Not having been spe- 
cifically predicted, however, they are some- 
what less convincing than the results cited 
above. 

Compared to terminators, remainers have 
a higher incidence of specific somatic dis- * 


2 df=1 unless otherwise indicated. 
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orders (asthma, eczema, stuttering) as op- 
posed to nonspecific somatic disorders (un- 
diagnosed pains, sweating, tension) (,? 
= 10.19, p< .01). 

When truancy was compared with other 
educational problems (reading and other 
learning difficulties, school phobia, school be- 
havior problems) it was found more often 
among terminators than among remainers (x? 
= 11.04, p < .001). 


Unconfirmed Predictions 


The following relations, although not sta- 
tistically significant, were in the predicted di- 
rection: The older the child, the greater the 
tendency to remain (¢ = .25). Families previ- 
ously known to other social agencies tend to 
remain (x? = .63). Families known to juve- 
nile court tend to terminate (,? = 1.27). 
Children with runaway problems tend to be 
among the terminators (y? = 2.08). Antiso- 
cial behavior is high among the terminators 
(x? = .07). The more advanced the mother’s 
education, the greater the tendency to remain 
(x? = 2.68). The lower the child’s intelli- 
gence, the greater the tendency to terminate 
(x? = .15). The middle income group, as op- 
posed to the low income and high income 
groups, tends to remain (,? = 3.79). 

The following comparisons were found to 
be in the opposite direction of predictions, 
but none reached a statistically significant 
level. It had been predicted that Negro fami- 
lies would tend to terminate (yx? = .99); that 
fee paying cases would remain (,* = .005); 
that children with all types of somatic dis- 
orders would be among the remainers (,’ 
= .19); that children with all kinds of edu- 
cational problems would tend to remain (,? 
= .95); and that the more advanced the fa- 
ther’s education the greater the tendency to 
remain (x? = .04). 


DIscuUSSION 


Studies of abrupt termination in adult psy- 
chotherapy have shown that a patient’s mo- 
tivation is one factor which differentiates the 
terminator from the remainer (Affleck & Med- 
nick, 1959). In child guidance therapy it is 
the parents’ motivation which is the deter- 
mining factor, but the parents must not only 
have the motivation to bring the child and 


support his treatment experience, but they 
must also have the capacity to involve them- 
selves actively in the therapy program. Pre- 
mature termination tends to occur when 
either of these important conditions is not 
met. 

Length of time on the waiting list between 
application and intake seems to be one meas- 
ure of parental motivation. The parent with 
low motivation would not be expected to re- 
main in clinic contact if he had to wait for 
any length of time. When treatment is finally 
started the group still on the waiting list may 
be assumed to contain a high proportion of 
well motivated individuals. Thus, there is a 
significantly greater proportion of waiting list 
cases among the remainers than among the 
terminators. There was a similar relation, al- 
though below the adopted level of significance, 
when the groups are compared on waiting 
from intake to diagnostic study (,? = 1.37, . 
p> .20) and on waiting from diagnostic 
study to beginning of treatment (y? = 1.79, 
p> .10). While a waiting list is an unde- 
sirable feature of a clinic’s operation, delay 
does, in many instances, serve as a screening 
device which eliminates poorly motivated pa- 
tients before they get to the point of treat- 
ment. 

The comparison which bears most directly 
on the parents’ ability to involve themselves 
in treatment is that between families where 
only one parent was in concurrent treatment 
and those where both parents were being seen 
individually. When only one parent is in con- 
current treatment, it is almost always the 
mother. The results clearly show that when 
the father, as well as the mother, can be in- 
volved in the treatment plan the chances that 
the case will terminate prematurely are greatly 
reduced. 

It has become more universally recognized 
that the father plays an important and often 
crucial role in the treatment of children 
(Rubenstein & Levitt, 1957). The findings 
here reported lend strong support to this 
trend. They suggest that when the father is 
not in treatment, he may either actively 
sabotage treatment efforts or, by failing to 
support the mother and the child in their 
treatment experiences, materially undermine 
the process. This may be especially true in 


4 
i 
+ 
» 
» 
| 
| 


Terminators and Remainers in Treatment 


the treatment of boys, the subjects in the 
present study. 

Parental motivation might be expected to 
be a function of the distress which the child’s 
symptoms cause the parents. Hiler (1959), 
studying adults in individual psychotherapy, 
found that patients who complain only of 
somatic symptoms are likely to terminate, 
probably because the symptom enables them 
to “bind” their anxiety. In child guidance 
therapy, the reverse seems to be true. The 
bizarre behavior of the mentally ill child and 
such specific somatic disorders as asthma may 
be assumed to be distressing to the parent, 
who would be very much aware of the dis- 
turbance and motivated to obtain help. The 
results of relevant comparisons strongly sug- 
gest that the more apparent the symptom, 
the more likely the case is to continue in treat- 
ment. 

Another indirect reflection of parental in- 
volvement and motivation may be the fact 
that remaining was positively related to his- 
tories of developmental difficulties. A child 
with such a history may be assumed to have 
a psychological problem of long standing so 
that his parents might be more aware of it, 
more distressed by it, and thus more highly 
motivated to remain in treatment. In addi- 
tion, a mother’s report of difficulties in her 
child’s early years may represent her aware- 
ness of her own contribution to the problem, 
an awareness Lake and Levinger (1960) re- 
port as a factor in continuance. 

The confirmed prediction that remainers 
would contain proportionately more cases with 
marital disharmony also bears on a parent’s 
personal involvement. The finding reflects the 
parents’ ability to talk about a problem which 
concerns them. In addition, marital dishar- 
mony is a distressing personal problem for 
parents and one for which they might hope to 
find some relief through the clinic contact. 

On the basis of studies reported from adult 
outpatient clinics (Hollingshead & Redlich, 
1958; Rubenstein & Lorr, 1956), it had been 
predicted that certain sociocultural factors 
would be related to continuance in child guid- 
ance therapy. While there was a trend for 
the remainers to contain proportionately more 
Jewish than Catholic families (x? = 2.92, p 
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= <.10), no difference between remainers 
and terminators in fathers’ occupational class 
was found (,* = 3.72, df=5, p= <.70). 
Tuckman and Lavell (1959), who compared 
social status at all stages of clinic contact, 
also found that higher status patients were no 
more likely to maintain contact with the clinic 
than lower status patients. This would sug- 
gest that child guidance clinics are more suc- 
cessful than adult outpatient clinics in help- 
ing patients from the lower socioeconomic 
levels to remain in treatment. 

It has been reported that a higher propor- 
tion of the continuers perceive the child’s 
problem themselves, rather than through the 
demands made by the community, and that 
they desire change in themselves, as well as 
in their child and in their spouse (Lake & 
Levinger, 1960). The present results are con- 
sistent with these findings. Terminators were 
more often referred by an authority, such as 
juvenile court or school, while remainers were 
more often referred by friends, social agen- 
cies, or themselves (y? = 3.77, p < .10). 

The incidence of truancy is significantly 
higher among terminators than among re- 
mainers when compared with other school 
problems. This result had not been specifically 
predicted. It seems that truancy is an expres- 
sion of pathological family behavior patterns 
where avoidance of anxiety-arousing situations 
takes the form of physical departure. A family 
which abruptly and unilaterally terminates 
clinic contact and a child who truants from 
school may be manifesting the same basic re- 
action to stress. 

While we were able to confirm a significant 
number © a priori predictions, this investiga- 
tion shar. ; the weakness of all ex post facto 
research on closed cases. A specific fact only 
becomes a research datum if the patient re- 
ports it accurately, the interviewer records it 
fully, and the person responsible for coding 
codes it correctly. It is hoped that a projected 
long range study will not only overcome these 
difficulties but also address itself to the ques- 
tion of treatment outcome and to the prob- 
lem of parental attitudes toward the child 
which Lake and Levinger (1960), among 
others, have suspected to be an important 
variable in continuance of treatment contact. 
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SUMMARY 


Families who terminated child guidance 
contact before the fifth treatment session 
were compared with families who continued 
treatment for a minimum of 16 interviews. 
Remainers had significantly more develop- 
mental difficulties, unusual behavior, marital 
disharmony, and specific somatic disorders. 
They contained significantly more cases where 
both parents were in concurrent treatment. 
Terminators had significantly more school 
truancy, and they had less often experienced 
a waiting period between application and in- 
take. The results were discussed in terms of 
the importance of the parents’ motivation and 
their ability to involve themselves in the treat- 
ment process. 
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CHRONICITY OF NEUROPSYCHIATRIC HOSPITALIZATION: 
A PREDICTIVE SCALE 


JAMES M. ANKER 
Veterans Administration Hospital, Perry Point, Maryland 


This paper reports an attempt to predict 
the iength of time that newly admitted pa- 
tients will stay confined in a neuropsychiatric 
hospital. Specifically, the primary concern is 
with the prediction of the relatively infrequent 
event of chronicity. Although most mental pa- 
tients admitted are discharged as improved 
within a reasonable period of time, long-term 
neuropsychiatric chronicity has become a ma- 
jor problem in the field of mental health 
(Giedt & Schlosser, 1955; National Commit- 
tee against Mental Illness, Inc., 1957). It is 
important to note that the probability of dis- 
charge decreases markedly with the passage 
of time so that, over the years, hospitals are 
accumulating populations which are predomi- 
nantly chronic. “In the average siate mental 
hospital, about 15% of the patients have been 
there less than a year; about 25% have been 
there between 1 and 5 years; about 60% have 
been there from 5 to 45 years or longer” (Na- 
tional Committee against Mental Illness, Inc., 
1957). Despite the fact that durations of hos- 
pitalization have decreased somewhat in re- 
cent years (Harris & Norris, 1954; Kramer & 
Pollack, 1957) the problem of identification 
and special treatment of potential chronics 
early in the course of their hospitalization .re- 
mains a most important and intriguing chal- 
lenge. 

Predictive studies dealing with the course 
of the mental patient, e.g., regarding psycho- 
therapy, outcome of hospitalization, and like- 
lihood of rehospitalization, generally have pro- 
duced results of low predictive value (Bar- 
ron, 1953a, 1953b; Bayard & Pascal, 1954; 
Briggs, 1958; Cole, Swensen, & Pascal, 1954; 
Crandall, Zubin, Mettler, & Logan, 1954; 
Dunham & Meltzer, 1946; Feldman, Pascal, 
& Swensen, 1954; Gallagher, 1954; Gildea & 


Man, 1942-43; Orr, Anderson, Martin, & 
Philpot, 1954-55; Peterson, 1954a, 1954b; 
Schofield, Hathaway, Hastings, & Bell, 1954; 
Swensen & Pascal, 1954a, 1954b). Two stud- 
ies offer more promising results. Meeker’s 
(1958) exploratory MMPI scale to predict 
length of neuropsychiatric hospitalization ap- 
pears to have greater value but his limited 
samples somewhat obscure the results. One 
study using demographic data has been able 
to predict length of neuropsychiatric hospital 
stay (more than or less than 90 days) with 
77.2% overall accuracy (Lindemann, Fair- 
weather, Stone, Smith, & London, 1959). A 
discussion of common shortcomings and diffi- 
culties in prognostic research may be found 
in Zubin and Windle (1954). They also point 
out the unusual prevalence of contradictory 
and low-order resuits. 


PROBLEM 


Two basic problems presented themselves 
for study. Improving the means of identifying 
potentially chronic patients among those newly 
admitted appeared necessary. Such identifica- 
tion could result in new, or at least more in- 
tensive, treatment procedures for this group 
and, hopefully, shorten their stay. Secondly, 
it seemed advantageous to develop an identifi- 
cation procedure which would be meaningful 
in relation to personality characteristics of the 
patient. Rather than using demographic data, 
for instance, this information should be more 
capable of suggesting specific types of treat- 
ment procedures aimed at reducing chronicity. 
For these reasons a personality inventory was 
chosen to provide the basic data for the con- 
struction of a predictive scale. The MMPI 
was selected because data were available. This 
paper reports the development of the scale. 
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ORIGINAL SAMPLE 


Procedure. MMPI protocols at least 1 year old 
were taken from the records of the psychology serv- 
ice of a large Veterans Administration hospital. Those 
protocols that had been obtained over 2 months after 
the date of admission were not used as data. Further, 
because of their small number, the records of female 
patients were excluded. Two criterion groups were 
selected from these protocols; a short-term group 
which stayed 6 months or less and a long-term group 
which stayed 1 year or longer. The short-term group 
numbered 103 and the long-term group 63. The data 
from these dichotomous groups were item analyzed 
for every item in the MMPI item pool by chi square 
from a 2 X 2 contingency table. These computations 
were verified graphically by solving the chi square 
quadratic and generating an elliptical ABAC in the 
manner suggested by Andrews (1952). 


Results. Fifty-five different items differenti- 
ated the criterion groups at the .05 level or 
less, 33 items at the .02 level or less, and 17 
items at the .01 level or less. These frequencies 
exceeded those for a series of statistical tests 
at less than the .01 level (Block, 1960; 
Sakoda, Cohen, & Beall, 1954). The point 
biserial correlation between scale score (each 


TABLE 1 


CoMBINED SAMPLE CHARACTERISTICS 


Diagnosis 
Schizophrenic reactions 204 
Affective reactions 7 
Psychoneurotic reactions 72 
Paranoid state 2 
Psychophysiologic reactions 4 
Acute brain syndromes 8 
Chronic brain syndromes 17 
Personality trait disturbances 13 
Personality pattern disturbances 6 
Sociopathic personality disturbances 16 
Transient situational personality disorders 4 
Unknown 5 

358 

Race 
Caucasian 309 
Negro 41 
Oriental 2 
Unknown 6 

358 

Age 
Mean 33.8 
Standard deviation 91 


Note.—The total sample of 433 was reduced to 358 through 
the elimination of short form and otherwise incomplete protocols. 
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item arbitrarily given a possible score of 1) 
and the dichotomous criterion of chronicity 
was .53 for all 55 items. The point. biserial 
correlation between scale score on the items 
differentiating at the .02 level or less and the 
dichotomous criterion or chronicity was .63. 
These values are only approximations, how- 
ever, because of the discontinuity between the 
two samples. These original results were sub- 
mitted to cross-validation procedures. 


Cross-VALIDATION SAMPLE 


Procedure. MMPI protocols over 1 year old which 
were obtained within 2 months of admission were 
solicited from a number of Veterans Administration 
neuropsychiatric hospitals.1 Protocols were sorted 
into the same criterion groups as in the original sam- 
ple. The group with durations of more than 6 and 
less than 12 months was held for later analysis. These 
data were analyzed for those 55 items which dif- 
ferentiated the criterion groups in the original sam- 
ple. The short-term group in the cross-validation 
sample numbered 144 and the long-term group 123. 
The diagnostic, racial, and age characteristics of the 
original and cross-validation sample combined are 
presented in Table 1. 


Results. The cross-validating item analysis 
produced 21 items which differentiated be- 
tween the criterion groups at approximately 
the .05 level or less, 11 at the .02 level or less 
and 9 at the .01 level or less. These items 
and their combined probabilities (Lindquist, 
1940) are presented in Table 2.* 


ScALE CONSTRUCTION 


The 21 items thus selected comprised the 
basic scale. Data from the original and the 
cross-validation samples were pooled and items 
were weighted according to their discrimina- 
tory ability (Guilford, 1942). Only one item, 
number 35 in the MMPI booklet, warranted a 
weighted score of 2. The other 20 items in the 
scale were weighted 1. 

Protocols from the group that stayed be- 
tween 6 and 12 months were combined with 


1The author wishes to express his gratitude to 
William C. Hallow, Veterans Administration Hospital, 
Lebanon, Pennsylvania; Burke Smith, Veterans Ad- 
ministration Hospital, Roanoke, Virginia; and John 
B. Marks, Veterans Administration Hospital, Ameri- 
can Lake, Washington, for their cooperation in sup- 
plying the cross-validation data for this study. 

2 The list of the 55 items which discriminated the 
criterion groups in the original sample is available 
from the author on request 
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TABLE 2 


Cross-VALIDATED SCALE ITEMS AND THEIR POOLED PROBABILITIES 


Number 
in MMPI 
Booklet Item 
I am sure I get a raw deal from life. (T) 
My sex life is satisfactory. (F) 
If people had not had it in for me I would have been much more 
successful. (T) 
My family does not like the work I have chosen (or the work I 
intend to choose for my life work). (T) 
I am liked by most people who know me. (F) 
I do not read every editorial in the newspaper every day. (F) 
I resent having anyone take me in so cleverly that I have had to 
admit that it was one on me. (T) 
I commonly hear voices without knowing where they come from. 


No one cares much what happens to you. (T) 
It does not bother me that I am not better looking. (F) 


It is safer to trust nobody. (T) 


278 I have often felt that strangers were looking at me critically. (T) 

309 I seem to make friends about as quickly as others do. (F) 

324 I have never been in love with anyone. (T) 

354 I am afraid of using a knife or anything very sharp or pointed. (T) 


427 I am embarrassed by dirty stories. 


(T) 


482 While in trains, busses, etc., I often talk to strangers. (F) 


488 I pray several times, every week. 


(F) 


495 I usually “lay my cards on the table” with people that I am trying 


to correct or improve. (F) 


540 My face has never been paralyzed. 


(F) 


556 I am very careful about my manner of dress. (F) 


Note.—Letters in parentheses indicate direction items are scored to reflect longer durations of hospitalization. 
* This item is assigned a scoring weight of 2. All other items have unit weight. 


protocols from the item analysis criterion 
groups. Unfortunately, there were only six 
subjects in this “middle” group. The number 
of short form MMPIs in the combined group 
was insufficient to provide normative data for 
the short form and thus these protocols were 
omitted from further analysis. 

The total number of protocols in this com- 
bined sample, excluding short forms, was 386. 
Scale scores were determined for each subject 
in the combined sample. In scoring protocols 
on the chronicity scale the number of items 
in the scale that were not answered was re- 
corded. Based on the somewhat arbitrary 
judgment of the author, those protocols hav- 
ing three or more omissions were not included 
in the data analysis. On this basis 28 subjects 
were dropped leaving a total of 358, all with 
scores on the chronicity scale, who had vari- 
ous durations of stay in the hospital. The 
Pearson correlation between scale score and 
duration of stay in months was .41. 


The sample was successively dichotomized, 
according to duration of stay, at 3 months, 
6 months, 12 months, 18 months, and 23 
months. No dichotomy was made at 9 months 
because of the uneven sampling for durations 
in this range. The frequency distributions of 
chronicity scale scores for the various dura- 
tion dichotomies are presented in Figure 1. 
These distributions have been smoothed and 
adjusted for the base rate of the particular 
duration of stay (Cureton, 1957). These base 
rates were estimated by determining the dura- 
tion of stay for every male patient given an 
MMPI after admission to a large Veterans 
Administration hospital between October 1954 
and September 1956. Estimates were based 
on 240 cases. 

The optimal cutting point to include all 
cases, after following Cureton’s procedure of 
smoothing and adjusting the area under the 
curves for base rate, is the point of intersec- 
tion of the two curves. These points are indi- 
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Fic. 1. Chronicity scale score distributions for dichotomized durations of hospitalization. 


cated in Figure 1. Higher selectivity can be 
obtained, of course, by using two cutting 
points minimizing the area of overlap in the 
mid-range. This paper, because it is con- 
cerned with the total patient population, re- 
ports the findings for one cutting point and 
all of the sample for each dichotomy. 

Use of the scale to improve decision mak- 
ing was the crucial consideration. This was 
evaluated at each of the points of dichotomy 
by the procedures suggested by Meehl and 
Rosen (1955). The predictions or “decisions” 
uniformly were to identify the group staying 


longer rather than the shorter duration group. 
Table 3 presents these data and statements 
of the inequalities indicating whether or not 
use of the scale improves decision making. 
Improvement, or predicting better than one 
could with base rate information alone, occurs 
when the inequality is satisfied. Use of the 
scale aids prediction of the longer stay pa- 
tient in all of the dichotomies except that for 
2 years or longer. It might be mentioned that 
the scale improves prediction of the shorter- 
term patient in all cases. 

Additional cross-validation data were ob- 
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TABLE 3 


ACTUAL AND PrepicTED LENGTHS OF STAY IN MontHS AT SEVERAL CuTTING POINTS 


Actual duration 


< 3 Months 


123.7 


(70.5%) 


(10007) 


< 6 Months 


(100%) 


< 18 Months 


(100%) 


< 23 Months 


> 3 Months 


68.0 
(36.8°,) 


117.0 
(63.2%) 


185.0 
(100°) 


6 Months 


59.5 
(42.1%) 

81.9 
(57.9%) 


141.4 
(100%) 


> 12 Months 


105.7 

(78.1%) 
29.7 

(21.9%) 


135.4 
(100%) 


> 18 Months 


> 23 Months 


429.4 


69 < 83 


Note.—P = proportion of longer durations of stay in a specified clinical population. Q =1— P. Pi = proportion of valid 
positives. P: = proportion of false positives. The different Vs in each of the dichotomies is an artifact produced by proportioning 
the areas under the curves for base rates. 


° + 
4 
429 
< 191.7 a 
> 51.7 168.7 
(29.5%) 48 < 68 
_ 
77 507 
a (77.5%) 
49.2 131.1 
(22.5%) <.72 
< 12 Months 
(95.59%) 
13.8 43.5 
(4.5%) 
(100%) 
303.0 69.1 372.1 pe 
(95.5°%) (80.7%) 
14.2 16.5 30.7 
(4.5%) (19.3%) 79 <8 
317.2 85.6 402.8 
(100%) 
333.6 63.7 397.3 
(95.1%) (81.1%) 
17.3 14.8 32.1 
“a (4.9%) (18.9%) 82 + .79 
z 350.9 78.5 
(100%) (100%) 
th 
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tained from the Veterans Administration Hos- 
pital, St. Cloud, Minnesota, on 204 neuro- 
psychiatric patients having durations of hos- 
pitalization of 1 year or longer. No data 
for patients staying less than 1 year were 
available. The data obtained were evaluated 
against the earlier data for patients staying 
less than 1 year by the method described 
above. The cutting point determined by the 
earlier cross-validation data was used. Pre- 
diction of durations of greater than 1 year by 
the scale was better than could be expected 
by chance or base rate information alone. The 
P, 

inequality Q < P, + P,’ 
tent decision making is improved by use of 
the scale, was .69 < .75 for this additional 
data, as compared to .69 < .83 for the origi- 
nal data. Thus, while the improvement in de- 
cision making (prediction) is somewhat less 
than the original data showed, this independ- 
ent sample supports the conclusions drawn 
earlier. Further data on 165 neuropsychiatric 
admissions to the Veterans Administration 
Hospital, Minneapolis, Minnesota, were gath- 
ered. The median length of stay for these 
neuropsychiatric admissions in this general 
medical and surgical hospital was 46 days. 
The sample was dichotomized at the median, 
the frequency distributions of scale scores 
smoothed, and the intersect taken as the op- 
timal cutting point. Predictions based on this 
cutting point also were better than could be 


which reflects the ex- 


Predicted duration 
of hospitalization 


Meeker scale 
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TABLE 4 


expected by chance or base rate information. 


P, 
in this case was 


The inequality 0 < 
P, t Ps 


50 < .68. 


RELATIONSHIPS WITH SIMILAR SCALES 


As Zubin and Windle (1954) might predict, 
studies using MMPI items to predict length 
of hospital stay have produced remarkably 
little item overlap between scales purporting 
to do the same thing. Meeker (1958) devel- 
oped a 28 item scale by item analysis of the 
MMPI items against a criterion of chronicity. 
Although his samples were relatively small 
and from the same area it is unusual to find 
that only 2 items overlapped with the 55 
items discriminating in the original sample of 
this study. Further, both of these items were 
dropped from the present scale in cross-valida- 
tion. A scale being developed at the Veter- 
ans Administration Hospital, American Lake, 
Washington, has little overlap, if any, with 
the present scale or with the Meeker scale.* 
The lack of congruence in these independent 
studies, in addition to being somewhat star- 
tling, is provocative. Geographical differences 
might be suggested but it is not likely that 
this variable alone would produce such di- 
vergence. Whatever the reason, adequate sam- 

3 Personal communication from John B. Marks, 


Veterans Administration Hospital, American Lake, 
Washington. 


VALUES 
2 


Marks scale 


50 < .56 50 


> 46 Days® - 50 < .56 < .68 
> 3 Months 48 < .56 48 < 50 48 < .68 
> 6 Months® 60 58 < 61 < .72 
> 12 Months® .69 < .76 69 < .00 69 < 83 
> 18 Months> 19 .69 < 1.00" 719 < Bl 
> 23 Months> —A 82 < .889 82 .79 
> 12 Months® 69 < .75 
*Sample of 165 neuropsychiatric admissions to the Minneapolis Veterans Administration General Medical and Surgical 


Hospital. 


> Data from original pooled cross-validation sample, N = 259. 


* Inequality not interpretable because cell frequencies approach zero. 
4 No cutting point obtained because frequency distributions did not intersect. : 
© Data from 204 neuropsychiatric admissions to the Veterans Administration Hospital, St. Cloud, Minnesota, that stayed 1 


year or longer. 
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pling and cross-validation appear to be the 
immediate course of action. In the present 
study an attempt was made to obtain ade- 
quate sample sizes and cross-validation. Re- 
garding the sample, however, the number of 
subjects who had durations of hospitalization 
between 6 and 12 months is decidedly de- 
ficient. Combined sample size was adequate 
but undoubtedly could be improved. 

These other scales were evaluated as to 
their capacity to improve decision making in 
comparison with the present scale on the 
same data samples whenever the other scale 
scores could be obtained. Table 4 presents 


the Q < P, + Ps values for each of the scales 


on a number of predictions and samples. The ~ 


present scale uniformly improves decision 
making more than the other scales, does so 
more consistently, and for predictions of 
longer durations of hospitalization. 

One might suspect a relationship between 
a scale predicting response to psychotherapy 
and one predicting length of hospitalization. 
There is only one item, however, which over- 
laps between Barron’s Ego Strength scale and 
the chronicity scale. For this one item the di- 
rection of scoring which would contribute to 
a favorable prognosis for psychotherapy from 
Barron’s scale would, in the chronicity scale, 
contribute to the prediction of long stay. It 
seems obvious that the two scales are not 
measuring the same underlying variable(s). 


DIscussIoNn 


The results of this study have provided a 
21 item scale which improves prediction of 
the longer stay patient. It can discriminate 
meaningfully for durations of hospitalization 
up to 18 months. Although accuracy of pre- 
diction decreases as duration increases, it does 
appear that the scale may have practical 
value, particularly for isolating the potentially 
chronic group with a minimum of “false posi- 
tives.” Such predictions undoubtedly could be 
improved by combining probabilities with a 
separate predictor such as the demographic 
instrument developed by Lindemann et al. 
(1959). It is also possible prediction might 
have been more accurate had slightly less 
rigorous standards been used in the selection 
of items. Although the .05 level was used in 
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the item analysis, the pooled probabilities 
were considerably lower. Predictive efficiency 
may have been enhanced had the .05 level, 
for instance, been used for the pooled prob- 
abilities, thus generating a longer scale. 

The substance of this study is the scale that 
has been developed empirically. While it will 
serve to improve identification of the chronic 
patient, at this point it provides only inter- 
esting clues about self-reported personality 
characteristics underlying chronicity. The 
scale will be factor analyzed to define the 
“roots” of chronicity insofar as they are re- 
flected in these MMPI items. These results 
will be presented in a subsequent paper. It 
was the intent of this study to produce a pre- 
dictive scale which might be independent of 
diagnosis, at least to the extent that it would 
eventually permit isolation of personality fac- 
tors influencing chronicity. Although it is 
most probable that these factors themselves 
will not be entirely independent of diagnosis 
it appeared fruitful to investigate them sepa- 
rately. 

Although considerable care was taken in 
the construction and cross-validation of the 
present scale, in view of the current contra- 
dictory findings in this area it must be viewed 
as tentative and properly subject to further 
evaluation and modification. It should prove 
to be of some immediate interest to the cli- 
nician addressing himself to the study of 
neuropsychiatric chronicity if it is used with 
the proper reservations. Eventually it may in- 
crease our understanding of chronicity and 
consequently improve our ability to study 
and impede it. 


SUMMARY 


The MMPI item pool was item analyzed 
against a dichotomous criterion of neuropsy- 
chiatric hospital chronicity. The 55 items 
which were found to discriminate the criterion 
groups in the original sample were cross-vali- 
dated on the pooled data from three separate 
Veterans Administration neuropsychiatric hos- 
pitals. A 21 item scale was generated which’ 
was able to predict the “long-stay” patient 
at various dichotomies in duration of stay 
better than one could by chance or by base 
rate information. Although it may be of some 
immediate value, because of the prevalence 
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of contradictory findings in this area, the 
scale should be used with caution until fur- 
ther verification and modification. The scale 
will be factor analyzed to provide some no- 
tion of the underlying “roots” of chronicity, 
at least insofar as they can be tapped by self- 
report on a questionnaire. These results will 
be presented in a later publication. 
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A variety of judgment tasks have discrimi- 
nated those with severe mental or emotional 
disorders from less severely disturbed per- 
sonalities (Chambers, 1956, 1957; Cooper, 
1960). Judgment tasks applied to clinical 
populations have varied from psychophysical 
judgments of length of lines, density of dots, 
etc., to projective types of judgment (e.z., 
judging personality traits from photographs, 
classifying ambiguous stimuli such as ink- 
blots, etc.). Sarbin and Hardyck (1955) de- 
fined the latter type of judgment as “con- 
formance” and applied the concept to tasks 
where there is no criterion of judgment va- 
lidity other than agreement with a specified 
norm group. 

In the present study a conformance type of 
judgment task was given to college students. 
It was hoped that the task might be suffi- 
ciently sensitive to discriminate between well 
adjusted and poorly adjusted students, even 
though poorly adjusted college students would 
not generally compare in severity of pathol- 
ogy with the psychotic groups previously dis- 
criminated by judgment tasks. 


METHOD 


Picture Identification Test (PIT). At the begin- 
ning of a school year, all students at Georgia South- 
western Junior College were given the group form 
of the PIT (Chambers & Broussard, 1960). There 
were 256 male and 186 female students tested. For 
part of this test the subjects were given 10 cards on 
each of which were eight head and shoulder photo- 
graphs of individuals of the same sex as the subject. 
With each card the subject was given a list of 21 
brief behavior descriptions representing 21 needs of 
the Murray (1953) need system. The subject was 
instructed to select a picture to match each descrip- 
tion using any picture as often as he desired. The 
subject thus made 10 selections for each of the 21 
needs, making a total of 210 choices. 


Scoring. Norms for the popularity of judgment 
choices were based on the choices of 100 randomly 
selected students of each sex from the general stu- 
dent population. The score for any selection was de- 
termined by the number of those in the norm group 
making that selection. Twenty-one judgment sub- 
scores were obtained by summing the 10 choice 
scores for each need. A total judgment score was ob- 
tained by summing all 210 choice scores. The scores 
of the subjects of the norm groups were adjusted to 
eliminate their own contribution to the norm figures. 
The 21 raw judgment subscores and the total judg- 
ment score were converted to standardized scores de- 
rived from the total student population and based 
on an eight-point scale with the mean separating 
points four and five on the scale. 

Subjects. At the end of the school year in which 
the PIT had been administered, each member of the 
faculty was asked to select the 10 best adjusted stu- 
dents of each sex and the 10 most poorly adjusted 
students of each sex from those of his acquaintance. 
The faculty members were instructed to judge stu- 
dents on the basis of emotional stability and ma- 
turity rather than on intellectual ability. None of the 
faculty knew of the PIT results. Fourteen faculty 
members felt they had enough general contact with 
the student body to make selections. Those students 
selected by three or more of the faculty judges as 
best adjusted were chosen for a “well adjusted” group 
and those receiving three or more choices as most 
poorly adjusted were selected for a “poorly ad- 
justed” group. This procedure provided 15 boys and 
16 girls for the poorly adjusted group and 16 boys 
and 17 girls for the well adjusted group. 


RESULTS 


Judgment subscores of well adjusted and 
poorly adjusted boys were compared for dif- 
ferences by means of the ¢ test. The well ad- 
justed group showed a higher mean degree of 
judgment conformity on all 21 subscores and 
they differed significantly (at the .05 level or 
better) from the poorly adjusted group on 19 
of the 21 subscores. 

A similar analysis was applied to the re- 
sults of the well adjusted and poorly adjusted 
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TABLE 1 
JUDGMENT ScorES OF WELL AbjusTED AND 
Poorty ApjusTED STUDENTS 


Poorly 


Well Adjusted Adjusted 
Score Boys Girls Boys Girls 
1 4 4 1 
2 3 4 
3 2 1 1 2 
4 5 5 2 3 
4 4 
6 1 1 3 
7 1 3 1 
8 1 4 3 
® Dotted line tniflentes diese general student population. 


female groups. The well adjusted girls showed 
a higher mean degree of judgment conformity 
on all but two subscores (n Dominance and 
n Infavoidance) and they were significantly 
more conforming in judgment than the poorly 
adjusted group on 11 of the 21 subscores. 

Table 1 presents the total judgment scores 
for all four groups. A test of the difference be- 
tween the combined male and female well ad- 
justed groups (N = 33) and the combined 
male and female poorly adjusted groups (NV 
= 31) yielded a ¢ of 11.09, significant at the 
.001 level. A cutting point at the mean for 
the general population of students (X = 4.5) 
classified 71% of the poorly adjusted students 
below the mean and 85% of the well adjusted 
students above the mean. 

Further comparisons revealed that the well 
adjusted groups scored significantly higher 
than the poorly adjusted groups on the Prince- 
ton Scholastic Aptitude Test (¢ = 2.64, p 
< .02). This finding raised the possibility 
that the PIT judgment score might be cor- 
related with the SAT. A Pearson r of .28 was 
found between the PIT total judgment score 
and the SAT. Based on an N of 212, this cor- 
relation was significant at the .01 level but 
was not high enough to indicate use of the 
PIT judgment score as an index of scholastic 
aptitude or to suggest that prediction of gen- 
eral adjustment to a college environment 
based on the PIT judgment score would yield 
highly similar results to prediction based on 
the SAT. 


Jay L. Chambers 


DISCUSSION 


Evidence from the present study and from 
those previously cited points to impaired or 
deviant perceptual judgment among the emo- 
tionally or mentally disturbed. In an attempt 
to account for the relationship between ad- 
justment and judgment, Sarbin and Hardyck 
(1955) reasoned that a lack of perceptual 
conformance would increase the probability 
of a deviant behavioral response since be- 
havioral reactions are dependent on percep- 
tual reactions. Mental and emotional disor- 
ders are defined by deviant behavior; there- 
fore there is a direct connection between poor 
perceptual conformance and maladjustment, 
according to their theory. 

The above theory does not state what con- 
ditions underlie poor perceptual conformance. 
The conditions could, of course, be due to 
brain damage or be of genetic origin. 

There is a further possibility that judgment 
may be distorted by emotions and attitudes. 
Jurors and judges are dismissed from cases 
where they hold certain attitudes affecting 
the case. Perceptual judgments may be sub- 
verted to maintain what Morgan (1956) has 
termed the self-preservation of attitudes and 
beliefs. According to this theory, there is a 
selective process whereby evidence favorable 
to existing attitudes is registered while nega- 
tive evidence is excluded. It is obvious that 
one who persistently modifies perception 
rather than belief, when the two are incom- 
patible, will eventually find himself at odds 
with others and with reality. It would seem 
better, from a mental health viewpoint, that 
perception should be uncensored and that 
attitudes should be influenced by perceptual 
evidence. 

Instances of poor perceptual conformance 
may provide indicators of specific psychologi- 
cal conflicts. An inspection of the data of the 
present study showed that, for a given indi- 
vidual, perceptual conformance varied con- 
siderably from one need to the next. Future 
research might profitably attempt to deter- 
mine whether specific problem or conflict 
areas can be diagnosed by differences in per- 
ceptual conformance scores of the various 
needs as measured by the PIT. 
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Judgment of Photographs and Adjustment 


SUMMARY 


The group form of the Picture Identifica- 
tion Test was administered to all students at 
a small state junior college at the beginning 
of a school year. Each subject received 21 
judgment subscores and a total judgment 
score based on his matchings of photographs 
of people with Murray need descriptions. At 
the end of the school year, 33 students were 
selected by faculty members to comprise an 
emotionally well adjusted student group and 
31 students were chosen as showing poor 
emotional adjustment. 

The boys chosen as well adjusted made sig- 
nificantly higher scores (more popular match- 
ings) than the poorly adjusted boys on 19 of 
the 21 judgment subscores. The well adjusted 
girls made significantly higher scores than did 
the poorly adjusted girls on 11 of the 21 
judgment subscores. 

On the total judgment score, the difference 
between the combined well adjusted and com- 
bined poorly adjusted groups was highly reli- 
able (¢ = 11.09, p < .001). Using the general 
population mean as a cutting point, 78% of 
the subjects could be classified by the total 
judgment score of the PIT in agreement with 


faculty selections of well adjusted and poorly 
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adjusted students. A low but significant r of 
28 (p< 01; N = 212) was found between 
the PIT total judgment score and the Prince- 
ton Scholastic Aptitude Test. 

The results were discussed in relation to 
Sarbin’s theory of perceptual conformance and 
Morgan’s theory of perceptual selectivity as 
a means of insuring the self-preservation of 
existing attitudes and beliefs. 
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The use of Rorschach summary scores in 
differential diagnosis, though widespread in 
clinical practice, has seldom been justified by 
the research literature. Attempts to cross- 
validate checklists such as the Miale and 
Harrower-Erickson (1940) signs of neurosis 
and the Davidson Rorschach adjustment scale 
(Davidson, 1950) have typically been unsuc- 
cessful (Berkowitz & Levine, 1953; Corsini & 
Uehling, 1954). Rieman (1953) combed the 
literature for Rorschach signs supposedly dis- 
criminatory between neurotics and schizo- 
phrenics. Of 86 such “indicators” he evalu- 
ated, only 5, or barely a chance expectation, 
discriminated significantly between the neu- 
rotic and schizophrenic subjects he studied. 

Results such as these have dampened re- 
search interest in Rorschach summary scores 
as diagnostic indicators, if the paucity of re- 
cent literature in this area may be taken as a 
criterion. However, related work has pointed 
up potential sources of error in the determi- 
nation of diagnostic signs which often have 
not been taken into account. From papers 
by Cronbach (1949), Fiske and Baughman 
(1953), and Knopf (1956) several guidelines 
for the empirical derivation of Rorschach 
signs may be abstracted: (a) the number of 
independent statistical tests applied should be 
kept at a minimum; (4) nonparametric meth- 
ods of statistical inference should be used; 
(c) the response total should be controlled; 
and (d) Rorschach scores and dependent vari- 
ables must be kept independent. The present 
study consists of the empirical selection of 
three Rorschach signs associated with severity 
of psychopathology in an exploratory study 
and two subsequent attempts to cross-validate 
these summary scores as indicators of schizo- 
phrenia. 


THREE RORSCHACH SCORES INDICATIVE OF 
SCHIZOPHRENIA 


IRVING B. WEINER 
University of Rochester School of Medicine and Dentistry 
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EXPLORATORY STUDY 


The exploratory sample consisted of all 
adult patients in a 6-month period from the 
psychiatric services of a general hospital for 
whom scorable Rorschach protocols were 
available, with the exception of those pa- 
tients whose primary diagnosis was organic 
rather than functional in nature. The sub- 
jects’ hospital and clinic records were uti- 
lized to assign them to one of three diagnostic 
categories: neurosis, which included cases of 
conversion hysteria, obsessive-compulsive neu- 
rosis, anxiety reaction, and neurotic depressive 
reaction; character disorder, which applied to 
instances of personality trait and personal- 
ity pattern disturbances; and psychosis, which 
included schizophrenic, psychotic depressive, 
and involutional psychotic reactions. The 
diagnostic criterion was the label assigned to 
the patient during the period of psychiatric 
evaluation in which the Rorschach had been 
given. For hospital patients, these labels were 
the discharge diagnoses; for clinic patients, 
the recorded consensus of an intake committee 
was used. 

The Rorschach protocols for this sample, 
which numbered 71, had previously been 
scored for the presence of a number of signs 
under investigation in a related context. Three 
of the signs unexpectedly appeared to differ- 
entiate among the diagnostic categories. To 
the extent that the labels neurosis, character 
disorder, and psychosis represent a continuum 
of increasing disturbance, severity of illness 
was significantly associated with tendencies to 
(a) give 1 or 2 CF, (b) have a Sum C be- 
tween 1.5 and 3.0, and (c) give at least 1 CF 
or C response with no C” responses. Table 1 
indicates the number of subjects in each diag- 
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Rorschach Scores Indicative of Schizophrenia 


TABLE 1 


FREQUENCIES OF EXPLORATORY SUBJECTS OF DIFFER- 
ENT DIAGNOSES WITH A GIVEN NUMBER OF SCHIZ0- 
PHRENIC INDICATORS 


Number of Indicators* 
Diagnostic 
Category 
Neurosis 
Character disorder 
Psychosis 


2 = 13.581 with 6 df; p 


nostic category who received from none to all 
three of these signs. Subsequent analysis re- 
vealed no significant differences between the 
three diagnostic groups in mean age or in 
mean or variance of response total. Further- 
more, each group was composed of approxi- 
mately two-thirds females and one-third 
males, thus mitigating any influence of sex 
difference. 


Cross-VALIDATION STUDIES 


Procedure 


Two separate samples were utilized to evaluate the 
concurrent validity of a checklist comprised of the 
above three summary scores. These samples, A and 
B, were again selected from case files and consisted, 
respectively, of 52 patients from the 6 months fol- 
lowing and 89 patients from the 12 months preceding 
the exploratory period. Since the psychotic group in 
the exploratory study had contained few nonschizo- 
phrenic patients, only schizophrenics were included 
in the psychotic group for the cross-validating sam- 
ples. Sample A contained 16 neurotics, 18 character 
disorders, and 18 schizophrenics; for Sample B the 
totals were 27, 31, and 31. The total population 
ranged in age from 15 to 62 with a mean of 29.62 
years and had given a mean number of 23.23 re- 
sponses to the Rorschach. Within each sample there 
were no significant differences among the three diag- 
nostic categories in mean and range of age or in 
mean and variance of response total. Each group had 
approximately 50% males, the six groups ranging 
from 44 to 61% males. 

It was impossible to determine the extent to which 
the Rorschach findings had influenced the diagnostic 
label attached to the patients, thus contaminating the 
data. However, it was the investigator’s hypothesis 
that even in cases where the decision to designate a 
patient schizophrenic had depended entirely on the 
Rorschach findings, it was unlikely that the three 
signs under consideration here—1 or 2 CF, Sum C 
1.5 to 3.0, and C or CF without C’—had contributed 
prominently to the psychologist’s opinion that the 


patient’s Rorschach was consistent with the presence 
of schizophrenia. To test this ancillary hypothesis, 
five staff psychologists, who had in fact done or su- 
pervised the testing of more than three-fourths of 
the subjects in the study, were asked to select from 
the following list of 10 Rorschach variables 6 which 
they felt were of either primary or secondary im- 
portance in the assessment of schizophrenia: (a) 
number and/or % Dd; (b) Sum C; (c) number 
confabulated and contaminated Rs; (d) number CF; 
(e) number and/or % P; (f) F+%; (g) num- 
ber and/or quality M; (hk) number C, Csymb, and 
Cnam; (i) amount bizarre content; and (j) number 


Results 


Table 2 indicates the number of subjects 
with different diagnoses in Samples A and B 
who received a given number of the signs. 
The degree of association between diagnostic 
category and number of signs, as estimated 
by x’, is significant beyond the .001 level of 
confidence for both samples. Further investi- 
gation of Table 2 reveals that the major dif- 
ference exists between the schizophrenic group 
on one hand and the neurotics and character 
disorders on the other. It may be seen that in 
Sample A, 87% of the neurotics and 78% of 
the character disorders received one sign or 
less, while only 22% of the schizophrenics 
had less than two signs; for Sample B the 
corresponding percentages are 81, 68, and 22. 
Chi square values computed for these grouped 
data were significant beyond the .001 level for 
both samples. Comparisons of the relative fre- 
quency with which the neurotics and charac- 
ter disorders received one or less or two or 


TABLE 2 
FREQUENCIES OF SUBJECTS OF DIFFERENT DIAGNOSES 
tN SAMPLES A AND B witH A GIVEN NUMBER OF 
SCHIZOPHRENIC INDICATORS 


Number of Indicators** 
Sample A Sample B 
Diagnostic 
Category oe 1 2 


Neurosis 0 
Character 

disorder 
Schizophrenia 


** For Sample A x? = 28.881 with 6 df; 


<.001. For 
Sample B x? = 30.072 with 6 df; p < .001. 
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more signs yielded y* values smaller than 1.00 
in each sample. 

The data were analyzed further to deter- 
mine if the three signs were contributing dif- 
ferentially to the above results. Six 2 x 3 
contingency tables were used to assess the 
association between the three diagnostic cate- 
gories and the presence or absence of each 
sign for Samples A and B. The obtained y? 
values were all significant beyond the .01 
level. 

The ratings by the psychologists of which 
Rorschach variables are prominent in their 
evaluation of possible schizophrenia also 
yielded a clear-cut result. Six of the elements 
were endorsed as being of primary or sec- 
ondary importance by at least four of the 
five raters, and a seventh was chosen by three 
raters. The remaining three variables were not 
selected by any of the psychologists as being 
of either primary or secondary importance in 
the diagnosis of schizophrenia. These three 
were Sum C, number CF, and number C’, the 
variables under investigation in this study. 

The fact that positive results were thus ob- 
tained for indicators not endorsed by the 
testers might suggest either that the test re- 
ports carried little weight in the final psychi- 
atric diagnosis or that the psychologists’ diag- 
nostic conclusions were little influenced by 
their expressed views on indicators of schizo- 
phrenia. To deal with this question, an addi- 
tional analysis was undertaken with Sample A. 
It was found that the psychologists’ test re- 
ports and the finally established diagnoses 
concurred in the presence or absence of schizo- 
phrenia in 90% of the cases, which casts 
doubt on the first of the above alternatives. 
Investigation of those of the endorsed signs 
which can be objectively scored revealed that 
the schizophrenic group had a significantly 
(p < .05) lower F + % and tended (p < .10) 
more frequently to have given 1 or more pure 
C response and/or 1 or more M— response 
and/or less than 1 M response than the non- 
schizophrenic groups. However, only five of 
the schizophrenics in Sample A had F + %s 
below 70, only five failed to give M, and only 
four gave any M— or C; only four had as 
many as two of these signs. Therefore, these 
scores, though differentiating between groups, 
were not present with sufficient frequency 


to have much facilitated the psychologist’s 
evaluation in the individual case. Further- 
more, no relationship could be discovered be- 
tween diagnostic category and P, P%, or 
Dd%. Consequently, it would seem in many 
cases that the psychologists, sometimes of 
necessity and sometimes of choice, had based 
their diagnostic impressions on indicators 
other than those they endorsed in the check- 
list. 


DISCUSSION 


The data clearly indicate that likelihood of 
being diagnosed schizophrenic in the psychi- 
atric setting studied is associated with tend- 
encies to give 1 or 2 CF, a Sum C from 1.5 
to 3.0, and C or CF without C’ on the Ror- 
schach. The fact that these three schizophrenic 
indicators were suggested by an exploratory 
study and cross-validated at highly significant 
levels of confidence in two subsequent studies 
makes it unlikely that they merely reflect 
chance fluctuations. The built-in controls for 
age, response total, and sex eliminate several 
frequent sources of error. The author’s em- 
phasis on nonexistent contaminating variables 
is felt necessary because the results cannot 
readily be explained in terms of prevalent 
Rorschach theory. There is no literature which 
suggests that these three signs are associated 
with schizophrenia, and the rating sheet used 
in this study demonstrates that psychologists, 
at least those in the setting where this re- 
search was done, do not consciously utilize 
these signs in evaluating schizophrenic poten- 
tial. That positive results were achieved seems, 
in view of the congruence between the ex- 
aminers’ judgments and the diagnostic labels 
of the subjects, attributable to the finding that 
some of the traditional signs of schizophrenia 
they endorsed were often ignored by them 
and others occurred with insufficient fre- 
quency to figure prominently in their indi- 
vidual diagnostic conclusions. The nature of 
the design does not allow further inference 
concerning the validity of these traditional 
signs, however. 

A tentative rationale for the findings may 
be offered. The three signs taken together rep- 
resent some minimal use of chromatic color 
without use of achromatic color. Individuals 
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Rorschach Scores Indicative of Schizophrenia 


who avoid color almost entirely or who use 
both chromatic and achromatic color freely 
would not receive the signs. It may follow 
that, within a patient population, those per- 
sons who use color freely are displaying the 
emotional lability frequently associated with 
repressive defense, while those who avoid 
color are manifesting the isolation of affect 
which accompanies intellectual defenses. The 
remaining patients, having neither pattern, 
might be viewed as lacking a stable defensive 
structure and being prone to schizophrenic 
reactions under stress. Other explanations are 
certainly possible, but confirmation of any 
hypotheses suggested by the data would re- 
quire additional research. 

It is the author’s feeling that even with 
present knowledge the data have value for the 
clinician. It should be noted that the signs 
were not derived and cross-validated by com- 
paring blatant schizophrenics with normals. 
Psychological test data gathered in such stud- 
ies are often useful for making discriminations 
only in situations where the discrimination is 
so obvious that psychological tests are super- 
fluous. The subjects in this study were se- 
lected from actual case files and typically had 


been referred because the diagnosis was not: 


clear. Hence they constitute the type of diag- 
nostic problem with which the clinician must 
deal in his daily practice, and it is within such 
a population that the three signs were so ex- 
tremely in accord with the diagnoses finally 
established. The signs by no means identified 
all the schizophrenic patients. However, the 
data seem to justify their inclusion among 
other Rorschach variables commonly con- 
strued as relating to the presence of a schizo- 
phrenic disorder. 


SUMMARY 


Three Rorschach signs—1 or 2 CF, Sum C 
between 1.5 and 3.0, and C or CF without 
C’—were found significantly associated with 
severity of psychopathology in an exploratory 
study. In two cross-validating studies, with 52 
and 89 subjects, each of these signs was re- 
ceived significantly more frequently by schizo- 
phrenic patients than by neurotics and char- 
acter disorders. The design contained controls 
for age and sex and for mean and variance of 
Rorschach response total. A tentative rationale 
for the results is offered, and it is felt that 
the data recommend these signs for inclusion 
among Rorschach criteria for the presence of 
schizophrenia. 
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Canter (1951) reported a descriptive study 
of MMPI profiles of patients with multiple 
sclerosis (MS). Although recognizing the limi- 
tations of the approach, he calculated the av- 
erage MMPI profile for this group of 33 
World War II veteran patients and inferred 
from the mean profile for the group that the 
typical personality configuration in MS in- 
cluded a reaction to the stress of the illness 
with depression and its accessory symptoms. 

When depression is a major variable under 
study, the averaging of MMPI profiles can 
obscure important profile differences espe- 
cially if subsamples are combined, one of 
which has very high D scores and the other 
of which has very low D scores. This is a par- 
ticularly important consideration in the study 
of an illness such as MS in which both reac- 
tions of high depression and low depression 
because of denial and repression have been ob- 
served. If discrete profile types were to be 
produced reflecting each of these reaction 
types, such an important difference would be 
cancelled out and masked by averaging. 

The aim of the present study was to at- 
tempt to check Canter’s MMPI findings and 
to answer the following questions: (a) Can 
a typical response to MS in the direction of 
depression be inferred from the MMPI? (4) 
Among patients with neurological lesions, are 
MMPI profiles indicative of depression more 
common in MS than in other conditions? To 
investigate the latter question, a control group 
of neurological patients was selected who had 
suffered brain injury from external causes. 

A second purpose of this paper is to report 
on significant relationships between MMPI 
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profile characteristics and illness and demo- 
graphic variables which became apparent when 
profiles were studied as depressed and non- 
depressed types rather than as average group 
profiles. 


METHOD 
Samples 


The MS sample consisted of 25 male veterans hos- 
pitalized on the Neurology Service of the Minneapolis 
Veterans Administration Hospital who had received 
a medical diagnosis of MS. The control group con- 
sisted of 25 male veterans from Neurology with a 
medical diagnosis of traumatic brain injury. Except 
for three cases in the MS group, the samples repre- 
sented all of the cases with these diagnoses who had 
been administered the MMPI and Wechsler-Bellevue 
on the Neurology Service during the years 1949 to 
1952. Three MS cases currently on the Neurology 
Service were tested and included in the analysis to 
increase the size of the sample. 

Age, IQ, and duration of illness data are reported 
in Table 1. The group differences in mean age and 
mean IQ were not statistically significant. The mean 
duration of illness of the Minneapolis MS sample 
was 58 months compared to 30 months for the 
control group; this difference is reliable (p < .001). 
Canter’s MS group would appear to be comparable 
to the present MS group in age and duration of ill- 
ness although there may be differences between the 
groups in socioeconomic status. Also, Canter studied 
outpatient veterans while the present sample of vet- 
erans was hospitalized. 

Table 1 also reports the MMPI average T scores 
of the three samples. All three of the mean profiles 
were highly similar in shape; Canter’s mean profile 
shows higher elevations on Hs, D, and Hy than the 
other two profiles. In addition, the Minneapolis MS 
sample is significantly higher on D and Sc than the 
Minneapolis control group. The magnitudes of the 
latter differences are small. 


Procedure 


The Minneapolis MS and control samples were sub 
divided on the basis of MMPI scores. The MMPI 
profile groupings were based on the following con- 
siderations. First, since there is a relatively high in- 
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MMPI in Multiple Sclerosis 


TABLE 1 


MEAN AGeE, IQ, Duration or ILLNESS, AND MMPI T Scores ror MINNEAPOLIS 
MULTIPLE SCLEROSIS AND CONTROL SAMPLES AND CANTER MULTIPLE SCLEROSIS SAMPLE 


Duration 
(months) 


Minneapolis 
MS sample 
(N = 25) 


Minneapolis 

control 

sample 30.6 
(N = 25)) 


102.9 


Canter MS 
sample 
NV = 33) 


cidence of one scale being elevated over T score 70 
even in normal samples (Hathaway & Meehl, 1951) 
it was decided to classify the profiles as abnormal if 
two or more scales were elevated beyond the normal 
limits. Second, since the investigation focused on de- 
pression, it seemed rational to cut the samples on the 
basis of elevation on the D scale. Third, inspection 
of the set of profiles suggested that the abnormal- 
normal dichotomy and the depressed-nondepressed 
categories would describe much of the relevant in- 
formation obtained in a majority of the profiles. 

The following categories and rules were adopted: 

1. Normal Depressed: Not more than one scale 
over 70. 

D> 60 


2. Normal Nondepressed: Not more than one scale 
over 70. 
D<60 
3. Abnormal Depressed: Two or more scales over 
70. 
D> 80 
4. Abnormal Nondepressed: Two or more scales 
over 70. 


D<80 


RESULTS 


Table 2 shows the resulting distribution of 
the profile types in the Minneapolis MS and 


TABLE 2 


FREQUENCY OF PROFILE TyPEs IN MULTIPLE SCLEROSIS 
AND CONTROL GROUPS 


Multiple 


Profile Type Sclerosis Control 


Normal Depressed 
Normal Nondepressed 
Abnormal Depressed 
Abnormal! Nondepressed 


Hy 


68.0 


60.1 


control samples. The chi square test revealed 
the distributions of the two samples to be re- 
liably different (,* = 7.76, p < .05). 

Table 3 shows the mean age, IQ, duration 
of illness, and MMPI T scores for each of 
the four profile types in the MS and control 
groups. Within the four MS profile types, sig- 
nificant F ratios were obtained for age (p 
< .05), IQ (p< .01), and duration of ill- 
ness (p < .01). From the Sheffé test (Sheffé, 
1953), which was applied to make all pos- 
sible comparisons of means, it appears that 
(a) the Normal Nondepressed type was sig- 
nificantly younger than the Abnormal De- 
pressed type, (6) the Normal Nondepressed 
type had a significantly higher mean IQ than 
the Abnormal Depressed type, (c) the Nor- 
mal Nondepressed type had a significantly 
shorter duration of illness than any of the 
three other types, and (d) the Abnormal Non- 
depressed type also had a significantly higher 
mean IQ than the Abnormal Depressed type. 

No significant F ratios were found in the 
analysis of the control group. 

There appeared to be an interaction be- 
tween the age, IQ, and duration of illness 
variables within the profile types in the MS 
sample. Compared to the mean score for the 
total Minneapolis MS sample, every case of 
the Normal Nondepressed profile type had 
lower than average age and higher than av- 
erage IQ and shorter than average duration 
of illness except for one case where age was 
at the mean. Seven of the eight cases of the 
Abnormal Depressed type had IQ scores be- 
low the mean of the total MS group. 
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MMPI in Multiple Sclerosis 


DIscUSSION 


From the present findings, it would seem 
safe to state that there is a reaction to MS in 
the direction of severe depression more fre- 
quently than in a neurological disorder such 
as traumatic brain injury. Thirty-two percent 
of the MS sample obtained MMPI profiles of 
the Abnormal Depressed type compared to 
4% of the control sample. The longer dura- 
tion of illness in the MS group could be a fac- 
tor contributing to the greater incidence of 
such profiles in the MS sample. Vieg (1947) 
reported that the most important mechanism 
used by patients in adjusting to MS was re- 
pression, but that as the disease progressed 
this kind of equilibrium could not be main- 
tained and the patients gave up their front of 
being emotionally well adjusted and happy. It 
would not seem warranted to call dysphoria 
the typical reaction to MS since only a third 
of the sample fell into the Abnormal De- 
pressed category. Rather, it would appear that 
there are several kinds of reaction to the dis- 
ease. Philippopoulos, Wittkower, and Cousi- 
neau (1958) found a broad range of person- 
ality structures in a group of 40 MS patients 
studied through interview and psychological 
tests. 

Most studies of MS (Grinker, Hamm, & 
Robbins, 1948; Langworthy, 1948; Philip- 
popoulos et al., 1958; Sugar & Nadell, 1948; 
Vieg, 1947) seem to show a strong thread of 
consistency in reporting that repression, in- 
ability to express hostile feelings directly, 
overconformity, and/or hysteroid traits in 
general are common in patients with MS 
either after, or both before and after the out- 
ward manifestations of the MS symptoms. It 
should be noted in passing that similar per- 
sonality traits have been reported in illnesses 
which are generally considered to be psycho- 
somatic in nature such as the recent investi- 
gation of ulcer patients by Marshall (1960). 
Some support for the observation that repres- 
sion and denial are preferred mechanisms of 
at least some MS patients could be found in 
the present MMPI data but the same evidence 
could be found for the control sample. It 
would be impossible to distinguish cause from 
effect in any case. 


On the one hand, it would seem that it is 
unwarranted to make sweeping generalizations 
about typical emotional reactions to MS, but 
on the other hand, it is not necessary to adopt 
a narrow idiographic point of view. There 
is evidence that multivariate classification on 
demographic variables such as age, intelli- 
gence, duration of illness, and perhaps sex, is 
needed to establish most effectively the rela- 
tionships between illness characteristics and 
personality variables. In the present study, it 
would appear that severe depression is more 
likely to occur in less intelligent male MS pa- 
tients and that younger, more intelligent male 
patients with shorter duration of illness are 
more likely to have a repertory of responses 
which enable them to avoid depression. These 
relationships are not evident in the control 
sample. The simplest explanation for the ab- 
sence of such relationships in the control sam- 
ple would be in terms of the disabling and 
progressive nature of MS which might evoke 
responses which would not be called forth by 
a brain injury that was more specific in its 
consequences and not progressive. However, 
this explanation would not seem to be com- 
pletely satisfactory and the possibility that at 
least some of the MS patients would not over- 
lap the control patients on significant person- 
ality characteristics cannot be ruled out. 

There is evidence in the present data of the 
need for longitudinal analysis in the study of 
MS. Data concerning preillness educational 
and occupational attainment were incomplete 
but suggested the need for further study of 
the possibility that the Abnormal Depressed 
MS cases had lower IQs and the Normal Non- 
depressed MS cases had higher IQs which 
antedated the onset of diagnosable MS symp- 
toms. The changes in patients over time after 
the development of symptoms is illustrated 
by the case of a 31-year-old college graduate 
with an IQ of 129 who received a diagnosis 
of MS shortly after the development of symp- 
toms of the disease. At that time he obtained 
an MMPI profile with a peak score on Hy at 
65 which would place him in the Normal Non- 
depressed profile category. Eight years later 
he obtained an IQ of 104 and an MMPI pro- 
file of the Abnormal Depressed type with a 
peak on D of 109. 
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In general, it might be concluded that it is 
dangerous to use any single measure of psy- 
chological status, presuming that such status 
is sensitive to a condition of illness, when it 
is either established or likely that the psycho- 
logical variable is correlated with demographic 
and other nonillness variables which have not 
been controlled. 


SUMMARY 


MMPI and Wechsler-Bellevue performance 
of 25 patients with MS and 25 control pa- 
tients with traumatic brain injuries indicated: 

1. There was a reaction of severe depres- 
sion in 34% of the MS group compared to 
4% of the control group as inferred from 
MMPI profile types. This supported Canter’s 
previous findings in part but did not appear 
to warrant the generalization that the typical 
response to MS is in the direction of depres- 
sion. 

2. MS patients with profile types indicating 
severe depression had lower IQs than average 
for the group of MS patients. 

3. Younger, more intelligent. MS patients 
with shorter duration of illness than average 
for the group tended to obtain nondepressed 
profiles. 

4. The MS group differed from the control 
group in that no relationship of profile type 
with IQ, duration of illness, or age was found 
in the control group. This was considered 
most likely to be due to the differing nature 
of the diseases but alternative explanations 
could not be ruled out. 

5. The importance of a longitudinal ap- 
proach to MS was indicated. 


6. The need was pointed out for control of 
all possible nonillness variables in order to be 
able to conclude that any single psychological 
characteristic is sensitive to a condition of 
illness. 
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DICHOTOMOUS EVALUATIONS IN SUICIDAL INDIVIDUALS 


CHARLES NEURINGER 


Suicide Prevention Center, Los Angeles, California 


It need not be pointed out that suicide is a 
leading cause of death in the United States 
and that it constitutes a serious mental health 
problem. Research is slow and difficult be- 
cause of a number of complex methodological 
difficulties, among which is the problem of an 
adequate definition of the suicidal rubric. So 
many different kinds of behaviors and experi- 
ences come under this heading that it is diffi- 
cult to perceive a common core of psychologi- 
cal characteristics that can be denoted as “sui- 
cidal” and easily discriminated from other 
pathological states. 

Edwin S. Shneidman (1960) has proposed 
such a core of characteristics of thinking in 
suicidal individuals. He posited that the neu- 
rotic-suicidal individual is the perpetrator of 
thought distortions, the presence of which 
leads to a high probability of suicide occur- 
ring. One of these proposed types of thought 
distortions was the tendency to think in terms 
of absolute value dichotomies. 

By Dichotomous Evaluative Thinking is 
meant the polarization of thought into an ex- 
treme double bind value system (e.g., “good 
vs. bad, right vs. wrong, beautiful vs. ugly”). 
The polarization is considered to be extreme 
in that the object of thought is considered as 
“all or mostly good” or “all or mostly bad,” 
with very little modulation of the object into 
finer discriminations, commonly called “shades 
of grey.” 

Shneidman feels that rigid adherence to ex- 
treme Dichotomous Evaluative Thinking can 
lead to situations that are lethal (e.g., if an 
individual is dissatisfied with his life and does 
not find it wholly acceptable to him, he does 
not think of alternate ways of changing his 
life but of death as the only alternative). 

Dichotomous thinking seems to be an 
“either-or” kind of value thinking and not 
an “and” kind of thinking. Shneidman con- 


cluded that the extreme dichotomous thinker 
is trapped in a double bind and must always 
embrace one of the extremes. The implication 
is that if one adheres to a strict dichotomy of 
thought, then one has few or no degrees of 
freedom with which to maneuver. Alternatives 
cannot be perceived and the situation becomes 
unresolvable, thus leading to ideas of escape 
through death. 

Should Dichotomous Evaluative Thinking 
be a distinguishing characteristic of suicidal 
thinking, this would lend support to the view 
that there are common characteristics that are 
organized into a core personality, which could 
be called the “suicidal personality.” 


PROBLEM 


The general question that the following re- 
search attempted to answer was whether sui- 
cidal individuals can be differentiated from 
(a) another emotionally disturbed group and 
(6) from a normal control group, as far as 
the presence and extent of extreme Dichoto- 
mous Evaluative Thinking was concerned. 


METHOD 


The Semantic Differential method, which was de- 
veloped by Charles Osgood (1957), was selected as 
a means of studying the hypothesis that Dichotomous 
Evaluative Thinking is a characteristic of suicidal 
thought. 

Dichotomous thinking of an extreme kind should 
reflect itself in extreme evaluations of the concept 
being tested. If suicidal subjects evaluate more di- 
chotomously than the control subjects, they should 
score a concept as being “more good, more beautiful, 
more unpleasant and more sad” than the control 
group evaluations of the same concept. 

Another reflection of dichotomous thinking would 
be found in the amount of difference in values be- 
tween two concepts that are semantically opposite 
such as love and hate or God and the devil. Greater 
Dichotomous Evaluative Thinking of an extreme 
kind in the suicidal individuals should reflect itself 
in the perception of greater value differences on the 
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same pair of opposing concepts when compared with 
the control groups. 

Consistent with Osgood’s suggestion concerning the 
use of the Semantic Differential when studying 
values, only those scales high on the evaluation fac- 
tor were utilized. In this study nine scales (good- 
bad, dirty-clean, nice-awful, unpleasant-pleasant, 
fair-unfair, worthless-valuable, happy-sad, dishonest- 
honest, and beautiful-ugly), and 18 concepts (de- 
mocracy, death, God, honor, communism, mother, 
success, love, life, murder, devil, father, myself, 
shame, failure, other people, hate, and suicide) were 
used. The concepts that were chosen reflect impor- 
tant people in one’s life, political systems, emotional 
states, behavioral acts, theological entities, etc. These 
kinds of concepts were selected on the assumption 
that they would tend to elicit strong evaluative re- 
actions in most people. It was felt that more imper- 
sonal concepts such as steel, apple, or skyscraper 
would not evoke measurable evaluative thinking. It 
is difficult to categorize steel as being good or bad, 
honest or dishonest, happy or sad, fair or unfair, etc. 
The concepts that were chosen had less veridical ob- 
jectivity and therefore should be open to greater 
value interpretation. 

Twelve of the concepts were organized into six 
semantically opposed pairs (God-devil, life-death, 
honor-shame, success-failure, love-hate, and democ- 
racy-communism) in order to test for the amount of 
perceived value difference between the members of 
the paired concepts. 


Scoring 


The scoring of a subject’s Semantic Differential for 
value extremeness was accomplished by assigning a 
score of 3 for extreme judgments like “very good” or 
“very bad,” a score of 2 for moderate judgments like 
“moderately beautiful” or “moderately ugly,” a score 
of 1 for judgments such as “mildly worthless” or 
“mildly valuable.” A score of 0 was assigned to the 
midpoint rating. The more extreme the judgment, the 
higher the score on the scale. The mean value ex- 
tremeness score per scale was recorded for the sub- 
ject. Each subject made 162 scale judgments. 

The scoring of a subject’s Semantic Differential for 
value differences was accomplished by comparing the 
rating differences between a pair of oppositional con- 
cepts scale by scale. The difference score per scale 
was the number of rating spaces separating the two 
judgments, including the scored spaces on the same 
scale. Complete opposition such as life being “very 
good” and death as being “very bad” received a score 
of 7. Complete identity received a score of 1. The 
greater the amount of disparity in scale ratings, the 
greater the value difference score. The mean value 
difference score per scale was recorded for the sub- 
ject. There were 54 scale difference scores available 
for each subject. The greater the value difference 
score, the greater the value disparity between the 
opposing concepts. 

Congruent with the hypothesis that suicidal indi- 
viduals evaluate more dichotomously than other peo- 


ple, it would be expected that the suicidal subjects in 
this study would have greater value extremeness and 
difference scores than the control subjects. 


Subjects 


The subjects were gathered from five Veterans Ad- 
ministration hospitals and one large metropolitan 
general hospital. Four of the hospitals were in the 
Los Angeles area and two were in the Missouri- 
Kansas area. Four of the hospitals were general medi- 
cal and surgical and two were neuropsychiatric. 

Three groups of 15 subjects each were utilized in 
the present study. The first of these groups was com- 
posed of individuals who had made a serious attempt 
at killing themselves (S group). The second group 
were subjects suffering from marked psychosomatic 
difficulties (PS group) and the last group was com- 
posed of normal hospitalized patients (N group). 

All the subjects were native-born, Caucasian males 
between the ages of 21 and 55. They were of normal 
intelligence as defined by the Information subtest 
of the Wechsler-Bellevue Intelligence Scale, Form I 
(Wechsler, 1944). (The S, PS, and N groups earned 
mean Information subtest scores of 16.4, 17.2, and 
17.6, respectively. The corresponding standard devia- 
tions were 4.1, 2.9, and 3.4. An analysis of variance 
was carried out and an F ratio of 1.46 was found 
which for 2 and 42 degrees of freedom was not sig- 
nificant at the .05 level of confidence.) None of the 
subjects were psychotic and they were all in good 
enough mental and physical shape to partake in the 
research. All the subjects were hospitalized at the 
time of the Semantic Differential administrations. The 
suicidal subjects were hospitalized because of their 
suicide attempts, and the psychosomatic subjects 
were in the hospital receiving treatment for their 
physical difficulties. Hospitalized normal patients were 
used as a control for the effects of hospitalization. 

None of the subjects received any kind of psy- 
chiatric or psychological treatment previous to par- 
taking in the research project. Great care was taken 
to select subjects who had had no electroshock, 
ataratic drugs, individual or group psychotherapy. 

Besides these general characteristics of the subjects, 
the suicidal subjects were chosen on the basis of their 
having made a bona fide suicidal attempt. The es- 
tablishment of the suicide attempt was made by (a) 
the verbal admission of the patient and (b) the pres- 
ence of some objective evidence of self-destruction 
such as a high barbiturate level in the blood, noxious 
chemical substances lavaged from the stomach, and 
deep surgical wounds on the body. In addition, the 
suicidal subjects carried a diagnosis of neurosis as 
defined by the American Psychiatric Association diag- 
nostic manual (1952). 

The psychosomatic subjects were categorized by 
(a) no history of suicidal threat or attempt and (6) 
a diagnosis of psychosomatic disability as defined by 
the above cited manual. 

The normal subjects were defined by (a) no evi- 
dence of suicidal threat or attempt and (6b) no evi- 
dence of emotional disturbance. The absence of emo- 
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tional difficulties was approximated with critical items 
from the Cornell selective indices (Mittlemann & 
Brodman, 1946). The normal patients were in the 
hospital only for medical and surgical problems of a 
transient nature, such as appendectomies, tonsil re- 
movals, and minor fractures. Patients that suffered 
from physical difficulties such as skin diseases or in- 
ternal gastric disorders were not utilized because of 
the possibility of unknown psychosomatic involve- 
ment. Patients who had remained in the hospital 
longer than 1 month were eliminated since there 
might be unknown psychological factors which were 
propedeutic to their staying in the hospital longer 
than prescribed by the nature of their physical dis- 
abilities. 


RESULTS 


The results of the study suggest that there 
is little difference between the suicidal and 
psychosomatic subjects as far as extreme Di- 
chotomous Evaluative Thinking is concerned. 
However, a significant difference was found 
between the two emotionally disturbed groups 
and the normal hospitalized subjects. 


Extremeness Scores 


The percentage of the different kinds of 
value extremeness responses made by the three 
groups of subjects is presented in Table 1. 
For the high value extremeness category 
(Number 3, indicating extreme judgments 
such as “very good, very bad,” etc.), the 
suicidal group earned the highest percentage 
of responses (71%), while the normal sub- 
jects earned the lowest percentage of responses 
(52%). This relationship is reversed for the 
low value extremeness category (Number 0, 
indicating neutral or equal judgments such as 
“equally good and bad, equally honest and 
dishonest,” etc.). For these modulated rat- 
ings, the normal group earned the highest per- 


TABLE 1 


PERCENTAGE OF RESPONSES MADE ON EAcH VALUE 
EXTREMENESS CATEGORY BY THE SUICIDAL, Psy- 
CHOSOMATIC, AND NORMAL SUBJECTS ON THE SE- 

MANTIC DIFFERENTIAL TEXT 


Extremeness Category 


S group 
PS group 
N group 


TABLE 2 

MEANS AND STANDARD DEVIATIONS OF THE VALU! 
EXTREMENESS SCORES PER SCALE FOR THE SUICIDAL, 
PSYCHOSOMATIC, AND NORMAL SUBJECTS 


1.93 
46 


centage (25%), and the lowest percentage 
(10%) was obtained by the suicidal subjects. 
The distribution of extremeness scores for the 
psychosomatic group lies closer to the suicidal 
group than to the normal group. The means 
and standard deviations of the extremeness 
score per scale are presented in Table 2. 

It appears that all the subjects used ex- 
treme value judgments and their judgments 
were distributed in roughly the same manner. 
But the suicidal and psychosomatic subjects 
made more extreme value judgments than the 
normal subjects. It was hypothesized that the 
suicidal subjects, if they used Dichotomous 
Evaluation Thinking to a greater degree than 
the control subjects, would assign more ex- 
treme value ratings to the Semantic Differ- 
ential scales than the control subjects. A sim- 
ple analysis of variance of the scale extreme- 
ness scores was computed and the summary 
is presented in Table 3. An F ratio of 6.62 
was found, which was significant at the .01 
level of confidence. The source of the differ- 
ences between the means was traced by the 
Tukey method (1949) and it was found that 
both the suicidal and psychosomatic groups 
earned significantly higher value extremeness 
scores than the normal group. There was no 
statistical difference between the two emo- 
tionally disturbed groups. 


TABLE 3 


ANALYSIS OF VARIANCE OF THE VALUE EXTREMENESS 
ScORE PER SCALE FOR THE SUICIDAL, Psycuoso- 
MATIC, AND NorMAL Groups 


Mean Square 
Between 1.06 
Within 16 
Total 


* Significant at the .01 level of confidence. 
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TABLE 4 


PERCENTAGE OF VALUE RESPONSES MADE oN EAcu 
DIFFERENCE CATEGORY BY THE SUICIDAL, Psycuo 
SOMATIC, AND NORMAL SUBJECTS ON THE SEMANTIC 
DIFFERENTIAL TEST 


Difference Category 


S group 61364 § 100 
PS group 26S 6 100 

7 


N group 26 10 8 267 6 1 


Difference Scores 


The percentage of scale value difference 
scores earned by the three groups of subjects 
is presented in Table 4. The left hand part of 
the table represents the more extreme value 
difference categories, while the right hand part 
of the table represents the less extreme value 
difference categories. The sucidal subjects 
placed 50% of their responses in the most 
extreme value difference category, while only 
26% of the normal subjects’ responses were 
found there. The least extreme value differ- 
ence category finds the normal group con- 
tributing 17% of their responses, while the 
suicidal group contributed 5% of their re- 
sponses. The psychosomatic group distribu- 
tion lies in between the suicidal and normal 
groups and is closer to the former. The means 
and standard deviations of the difference score 
per scale are presented in Table 5. 

From the tabular material, it can be seen 
that all the subjects made a great number of 
value differentiations between the opposing 
concepts. The only difference seems to lie in 
the extent of the differences. It was hypothe- 
sized that the suicidal subjects, if they used 
Dichotomous Evaluative Thinking to a greater 


TABLE 5 


MEANS AND STANDARD DEVIATIONS OF THE VALUE 
DIFFERENCE SCORES PER SCALE FOR THE SUICIDAL, 
PsYCHOSOMATIC, AND NORMAL SUBJECTS 


Measure SGroup PSGroup N Group 


Mean 5.59 5.03 4.34 
SD 80 80 1.00 


degree than the control subjects, would per- 
ceive greater value differences between the 
paired oppositional concepts than the control 
subjects. A simple analysis of variance was 
carried out on the scale difference scores and 
a summary of the analysis is presented in 
Table 6. An F ratio of 6.77, significant at the 
.01 level of confidence, was found. The source 
of the significant difference was traced by the 
Tukey method (1949) and it was found that 
the suicidal and psychosomatic groups earned 
significantly greater value difference scores 
than the normal group. There was no sta- 
tistical difference between the two emotion- 
ally disturbed groups. 


TABLE 6 


ANALYSIS OF VARIANCE OF THE VALUE DIFFERENCE 
ScoRE PER SCALE FOR THE SUICIDAL, PsycHoso- 
MATIC, AND NoRMAL GROUPS 


Source df Mean Square F 
Between 2 5.15 6.77* 
Within 42 76 
Total 


* Significant at the .01 level of confidence. 


DISCUSSION 


The evidence gathered in this study did 
not support the contention that Dichotomous 
Evaluative Thinking was primarily or solely 
a characteristic of the thinking of the suicidal 
individuals utilized in this study. The suicidal 
subjects tended to score their Semantic Dif- 
ferentials in such a manner as to reflect the 
presence of dichotomous evaluations. How- 
ever, both control groups showed this tend- 
ency, and the psychosomatic subjects showed 
just as many dichotomous evaluations as the 
suicidal subjects. None of the three groups 
of subjects demonstrated exclusive function- 
ing as far as Dichotomous Evaluative Think- 
ing was concerned. The differentiation among 
the three groups seemed to be the amount of 
Dichotomous Evaluative Thinking that the 
subjects used. 

It should be stressed that the findings of 
this study should not be generalized to other 
aspects of thinking, but should be restricted 
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to the cognitive organizations of values. It is 
not known whether emotionally disturbed in- 
dividuals utilize more dichotomous evaluations 
than normal people when dealing with im- 
personal objects such as “tin cans or soup 
spoons.” Neither is it known whether Di- 
chotomous Evaluative Thinking extends into 
areas of cognitive organization where values 
do not play a major role (e.g., syllogistic rea- 
soning, mathematics, etc.). 

Even though it has been suggested by Os- 
good (1957) that most everything is thought 
of in terms of some kind of value orientation, 
it cannot be said that Dichotomous Evalua- 
tive Thinking would have been found if scales 
high on the Activity and Potency factors were 
used in the present study. It may well be that 
steel is stronger or that carbonated water is 
“fizzier” for emotionally disturbed individuals 
in comparison to normal people, but data that 
would substantiate such a contention are not 
available. 

The concept of Dichotomous Evaluative 
Thinking should be restricted to objects of 
thought which are pregnant with personal 
meanings and therefore elicit strong value re- 
actions in most people. 

If, on the basis that self-destruction is a 
drastic solution to personal difficulties, it is 
assumed that the suicidal individuals were 
under the greatest amount of stress, and that 
the normal subjects had comparatively the 
lowest level of stress, then the results of this 
study might be compatible with the inter- 
pretation that stress causes Dichotomous 
Evaluative Thinking to take on pathologi- 
cal and nonadaptive characteristics. A cog- 
nitive mode of thinking that is normal un- 
der normal conditions becomes a pathological 
caricature when the person is under great 
psychic stress. It would appear that excessive 
utilization of Dichotomous Evaluative Think- 
ing is a common characteristic of neurosis 
and emotional disturbance. It is felt that sui- 
cide is not caused by Dichotomous Evaluative 
Thinking, but rather that its pathological and 
excessive utilization accompanies personality 
decompensation, one other manifestation of 
which might be suicide. 


SUMMARY 


The hypothesis that suicidal individuals 
think more dichotomously than other emo- 
tionally disturbed people and normal subjects 
was tested by the use of the Semantic Dif- 
ferential test. 

Two measures of Dichotomous Evaluative 
Thinking were devised. They were (a) a 
value extremeness score (extremeness of value 
judgments on the Semantic Differential scales) 
and (5) a value difference score (the amount 
of value difference perceived between two op- 
posing concepts per scale). 

Three groups of 15 subjects each were uti- 
lized. They were a group of individuals that 
attempted suicide, psychosomatic patients, 
and normal hospitalized patients. All the sub- 
jects were Caucasian native-born males. Each 
subject was administered a Semantic Differ- 
ential form consisting of 18 concepts with 
nine scales for each concept. 

The results indicate that for the value ex- 
tremeness and difference scores, the suicidal 
and psychosomatic subjects earned higher 
scores than the normal subjects. No statisti- 
cal differences were found between the sui- 
cidal and psychosomatic groups on the two 
types of scores. 

It was concluded that Dichotomous Evalua- 
tive Thinking seems to be a common charac- 
teristic of emotionally disturbed persons and 
was not an exclusive factor in the thinking 
of suicidal individuals. 
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EVALUATION OF PSYCHOTHERAPY AS AN ADJUNCT TO 


There are three main viewpoints prevalent 
today concerning the value of psychotherapy 
as an adjunct to the somatic therapies: (ca) 
some form of psychotherapy must be com- 
bined with somatic therapy if genuine im- 
provement of the patient is to occur (Palmer 
& Riepenhoff, 1950); (6) it is undesirable to 
combine psychotherapy with somatic therapies 
since somatic therapies may actually render 
the patient less accessible to psychotherapy 
(Frank, 1950; Rosen, 1953; Sullivan, 1940) ; 
and (c) psychotherapy contributes little or 
nothing to the treatment of the patient over 
and above such methods as insulin-coma ther- 
apy (Kalinowski & Hoch, 1952). 

The present study sought at least a partial 
answer to the following question: to what ex- 
tent does psychotherapy contribute to the im- 
provement of schizophrenic patients who are 
undergoing insulin-coma treatment? Studying 
the effects of psychotherapy in conjunction 
with insulin therapy and the total treatment 
program that is associated with it furnished 
an unusually rigid test of the value of psycho- 
therapy. In this case, the comparison was not 
between a group receiving psychotherapy and 
a group receiving no specific treatment, but 
it was between a group receiving insulin ther- 
apy as a part of a general treatment program 
and a group receiving this plus psychotherapy. 


METHOD 


Essentially, the research design consisted of com- 
paring the improvement made by two groups of 


1 This paper is based upon a doctoral dissertation 
done at the University of Texas, January 1955. The 
writer wishes to express his appreciation to Wayne 
Holtzman for his guidance and enthusiastic support. 
Recognition is also due the staff of the Waco Vet- 
erans Administration Hospital where this study was 
conducted. 
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schizophrenics undergoing deep insulin-coma therapy. 
The experimental group differed from the control 
group in only one variable; namely, the addition of 
individual and group psychotherapy to the total in- 
sulin treatment program. The progress made by the 
patients was assessed by diverse techniques judged 
to be objective, quantifiable, and meaningful. The 
judgments and ratings of improvement were con- 
ducted in such a way as to eliminate contamination 
of possible bias on the part of the investigators. The 
psychotherapy used with the experimental subjects 
was studied by means of recordings, therapists’ notes, 
and quantified therapists’ ratings. 


Subjects 


Over a 9-month period complete data were gath- 
ered on 19 experimental and 18 control cases. All 
cases were male veterans in the Veterans Adminis- 
tration Hospital, Waco, Texas. As patients were ad- 
mitted to the insulin service, they were randomly as- 
signed to the experimental and control groups. No 
significant differences were found between the two 
groups in age, education, or chronocity of neuropsy- 
chiatric condition. Pretreatment ratings on scales 
based on psychiatric interviews, observed ward be- 
havior, and psychological tests clearly demonstrated 
the presence of psychopathology in all patients prior 
to therapy.* 


Insulin Treatment Service 


The insulin service accommodated a maximum of 
16 patients so that each group consisted of no more 
than 8 patients at any given time. All the research 
patients lived on the same ward and were exposed 
to the same ward personnel, psychiatrists, social 
worker, and psychologists, as well as to the same 
general environment. They were housed in a special 
wing which provided treatment room, day room, 
and sleeping quarters. 

Typically, the patients underwent insulin coma 5 
days a week. In rare cases insulin coma was com- 
bined with electric shock treatment, or the comas 

2A detailed description of these subjects is avail- 
able in Roos (1955). 

3 The scales and rating procedures are described in 
a later section of this paper. 
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were interspersed with shock treatments.4 These 
modifications in treatment were determined by the 
psychiatrist who kept a very close watch on each 
patient. The overall treatment time was set at 10 
weeks by the psychiatrist so that all patients would 
have the same amount of somatic therapy and the 
same length of time on the intensive treatment 
service. 

Occupational therapy, recreational therapy, and so- 
cial service functions were available to all patients. 
For the duration of the research project, however, 
the social service worker limited himself to discussing 
superficial reality problems with the insulin patients 
in order to avoid possible contaminating effects. 


Evaluative Criteria 


Three types of evaluative measures were used: (a) 
scales based on a psychiatric interview and observed 
behavior on the ward, (b) scales based on psycho- 
logical test data, and (c) length of stay in the hos- 
pital following termination of the insulin coma treat- 
ment. 

Multidimensional Scale for Rating Psychiatric Pa- 
tients (MSRPP). The MSRPP is divided into two 
sections, one referring to a psychiatric interview and 
the other focusing upon ward behavior as judged by 
nurses and psychiatric aides (Lorr, 1953). Prelimi- 
nary studies utilizing this scale report satisfactory 
reliability and validity (Lorr, 1953; Lorr, Jenkins, & 
Holsopple, 1954). The scale yields an overall mor- 
bidity score indicative of degree of pathological 
variation from “normal” behavior. 

All those using the scales were carefully instructed 
in the proper rating procedure. Common rating er- 
rors, such as halo effect, central tendency and posi- 
tion effect, were discussed in group meetings. The psy- 
chiatrist in charge of the insulin service was re- 
quested to use the MSRPP Interview scales to record 
his impressions of the patient based upon an inter- 
view prior to onset of insulin treatment and again 
ten weeks later when treatment was terminated. The 
psychiatrist was in close contact with each patient 
throughout his treatment as well as for several weeks 
prior to its initiation. He was unaware, however, of 
which patients received psychotherapy and was care- 
ful not to elicit material from them which might re- 
veal this information to him, thus preventing the 
possibility of unconscious bias influencing his judg- 
ments. 

The MSRPP Ward Observation scales were pre- 
sented to the nurses, aides, occupational therapists, 
and recreational workers, and they were asked to 
rate each patient biweekly. These workers had close 
daily contact with the patients so that these ratings 
should be revealing of the patients’ behavior in so- 
cial situations and on the ward. These raters were 
given the impression that the research was concerned 
only with intensive study of patients undergoing in- 
sulin treatment. Thus, independent behavioral ratings 


4 The treatment data on each patient are summa- 
rized in Roos (1955). 
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obtained from the ward personnel provided unbiased 
criteria of therapeutic success. 

Psychological Tests. Each patient was administered 
a battery of psychological tests before the beginning 
of treatment and again upon its completion. These 
tests were given by a psychologist other than the 
experimenter, who was unaware of whether a given 
patient was in the control or experimental group. 
The battery included the following group-adminis- 
tered tests: (a) the Grayson Perceptualization test, 
(b) the Kent EGY intelligence test, (c) the Human 
Figure Drawing Test, (d) the Bender-Gestalt, (e) a 
sentence completion test, and (f) a modification of 
the Wechsler-Bellevue Picture Completion Test. All 
research patients were also given the Rorschach test 
before and after treatment. This test was adminis- 
tered individually with a standard inquiry following 
the technique outlined by Klopfer and Kelley (1946). 
Each protocol was then scored, the psychogram was 
plotted, and the main ratios computed. 

Test interpretation scales intended to quantify sig- 
nificant aspects of personality which might be ex- 
pected to show changes as a result of psychotherapy 
were devised by the investigator. Two sets of rating 
scales were developed. One set, consisting of five 
steps, was designed to assess the patient’s function- 
ing in certain areas at the time of testing. The other 
set, consisting of seven steps, was aimed at estimat- 
ing degree of change taking place during a lapse of 
time in the same areas of functioning. The first set, 
referred to as Cross-Sectional Rating scales, was ap- 
plicable to a single test. The second set, referred to 
as Comparison Rating scales, was applicable to com- 
bined pre-posttests. The names of the seven scales 
follow: 5 Intellectual Efficiency, Emotional Function- 
ing, Psychopathological Characteristics, Interpersonal 
Efficiency (Nature of Social Contacts), Interpersonal 
Efficiency (Need Gratification), Self-Attitudes, and 
Psychosexual Level. 

Three clinical psychologists served as raters; all 
had considerable experience in psychodiagnostics. The 
scales were discussed in detail with them. Only after 
the scales were clearly understood and considered 
unambiguous by all raters were they applied to the 
research cases. 

Cross-Sectional Ratings. All identifying data were 
removed from the test items as well as any infor- 
mation pertaining to whether the given material was 
obtained before or after treatment. The group tests 
were treated as one unit, and the Rorschach was 
treated as a separate unit. The raters were first pre- 
sented with a set of group tests from 10 patients, or 
a set of 10 Rorschachs, consisting of pre- and post- 
tests and both control and experimental cases. They 
then rated each Rorschach and each set of group 
tests (which were treated independently and without 
knowledge of which group tests belonged with which 
Rorschach) on the Cross-Sectional Rating scales. 
Following this procedure, which was repeated with 
several sets of test protocols for 10 patients at a 

5 The actual operational definitions for each step 
on the scales can be found in Roos (1955). 
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time, each rater was handed a set of 10 Rorschachs 
paired with the corresponding group tests. Again the 
raters were unaware of whether an individual set of 
tests had been gathered before or after treatment or 
whether it belonged to an experimental or control 
patient. 

Comparison Ratings. The next step in the rating 
procedure consisted of giving each rater a set of 10 
pairs of Rorschachs or 10 pairs of group tests, each 
pair consisting of the pretreatment and posttreatment 
protocols for a single patient. He was told which of 
each pair was pre- and which was posttreatment and 
was requested to complete the Comparison Rating 
scales. As in the previous rating sessions, each set of 
10 contained both experimental and control cases, 
selected randomly. 

Finally, after evaluating several sets of pre-, post- 
Rorschachs and group tests, each rater was handed 
a set of 10 complete pairs, each pair consisting of 
the pretreatment Rorschach and corresponding group 
tests and the posttreatment Rorschach and group 
tests of the same patient. The rater was told which 
were the pre- and which were the posttreatment tests 
and was requested to complete Comparison Rating 
scales on the basis of the entire test battery. 


Psychotherapy 


The experimental group met three times a week 
for group psychotherapy sessions conducted by the 
experimenter and a staff psychologist. Each of the 
therapists met the group alone once a week, and on 
the third day both acted as cotherapists. In addition, 
the patients were seen for individual psychotherapy 
sessions by the experimenter and staff psychologist. 
They were assigned randomly to each of the psycho- 
therapists for these individual sessions. The time and 
number of individual sessions per patient were deter- 
mined in part by the therapist’s judgment and in 
part on the basis of the patients’ requests for indi- 
vidual sessions. On the average, two weekly indi- 
vidual meetings were held. All experimental patients 
were told they could request individual sessions as 
their needs arose. 

The therapists made notes on each patient’s be- 
havior immediately following every therapy session 
(group as well as individual), and each group ses- 
sion was briefly summarized in terms of basic themes 
and tensions. Many of the sessions, both individual 
and group, were recorded and filed.® 


RESULTS AND DISCUSSION 


The experimental design of this study was 
aimed primarily at furnishing at least a par- 
tial answer to the question: can the effects of 
short-term psychotherapy used as an adjunct 
to insulin-coma treatment be measured over 
and above the effects of insulin-coma therapy 
alone? By comparing the changes in relevant 


6Sample transcriptions may be found in Roos 
(1955). 


personality variables between pretreatment 
and posttreatment evaluations of the experi- 
mental patients with the changes found in the 
control patients, it should be possible to de- 
termine whether the group receiving psycho- 
therapy in addition to insulin improved sig- 
nificantly more than the group receiving in- 
sulin alone. 


Analysis of Measures of Personality Used as 
Criteria for Evaluation of the Effects of Psy- 
chotherapy 


Two kinds of criteria are available for 
evaluating the effects of psychotherapy: (a) 
ratings on a number of personality traits made 
by clinical psychologists after careful study 
of individual protocols of psychological tests 
administered before and after treatment, and 
() changes in ward behavior and psychiatric 
interview as quantified by scores on the 
MSRPP. 

Psychological Test Ratings. The psycho- 
logical test scales were analyzed with respect 
to two main factors: (a) the degree of agree- 
ment between two independent raters on each 
scale, and (5) significance of changes indi- 
cated between pre- and posttreatment testing. 
When the interrater agreement for both scales 
involved in a pre-post treatment or intergroup 
comparison was found to be at least .50, the 
separate ratings of each psychologist were 
pooled and averaged. When one or both inter- 
rater correlations were less than .50, the rat- 
ings showing the greatest variability as indi- 
cated by the largest standard deviation were 
used as criterion measures of personality. 

The cross-sectional ratings produced lower 
and more inconsistent reliability than the 
comparison ratings. In general, the degree of 
agreement between the two raters was better 
than chance on both cross-sectional and com- 
parison ratings, although a few correlations 
were near zero or negative. Comparison of the 
number of significant correlations between 
raters judging on the basis of group tests 
alone, Rorschach alone, or a combination of 
group tests and Rorschach, revealed little dif- 
ference. Thus, it seems that increasing the 
amount of available data on which to base 
ratings did not increase the degree of agree- 
ment between raters. This finding raises an 
important question for further research, since 
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many Clinicians believe that personality evalu- 
ations based on a comprehensive battery of 
tests are more reliable and valid than those 
derived from single tests. 

Comparison of the mean pre- and posttreat- 
ment scores obtained from the experimental 
and control groups on each of the Cross-Sec- 
tional Rating scales revealed marked differ- 
ences in the ratings and what they signify 
from one set of tests to the next (e.g., group 
tests alone, Rorschach alone, and group tests 
combined with Rorschach). From a total of 
nine significant (.05 level or beyond) differ- 
ences between pre- and posttests, eight were 
derived from ratings based on the group tests 
alone and one was derived from ratings based 
on the group tests and Rorschach combined. 
It appears from these data, therefore, that the 
group tests were quite sensitive to changes oc- 
curring during treatment and that ratings de- 
rived from them may be of value for inter- 
group comparisons. On the other hand, the 
Rorschach tests—or at least judgments de- 
rived from them by the raters in this study— 
did not significantly reveal these changes. One 
possible interpretation of these results is that 
the personality characteristics measured by 
the Rorschach are “deep” enduring genotypi- 
cal traits which seemingly remain unchanged 
as a result of treatment. Another more pos- 
sible interpretation, however, is that the clini- 
cal judgments based on the Rorschach do not 
reflect the personality characteristics in ques- 
tion. 

Comparison ratings on all scales, irrespec- 
tive of the data upon which they were based, 
indicated improvement, suggesting that they 
are suitable for comparing the experimental 
and control groups. These results seem to con- 
flict with those derived from the cross-sec- 
tional ratings which indicated that only those 
ratings based on the group tests revealed im- 
provement. In all probability, the knowledge 
of which data were pretreatment and which 
were gathered after treatment greatly influ- 
enced the raters who believed, in general, that 
most patients were improving. Rater bias 
should affect the experimental and control 
groups equally, however, since the raters had 
no clues as to which protocols were obtained 
from the experimental group and which were 
obtained from the control group. 


MSRPP Ratings 


A measure of psychopathology was derived 
from the MSRPP by combining the ratings 
based on the psychiatric interview with the 
average of the four ratings made by the ward 
personnel. Comparison of pre- and posttreat- 
ment ratings revealed a significant decrease 
in pathology as measured by the MSRPP (be- 
yond .001 level). These results, however, do 
not permit any generalizations regarding in- 
sulin-coma therapy, since the present study 
was not designed to evaluate this type of 
treatment. 


Comparison between Experimental and Con- 
trol Groups 


On the basis of the preceding analysis of 
measures used in this study to assess person- 
ality changes, the following types of data were 
found to reveal changes between pre- and 
posttreatment evaluations: (a) the cross-sec- 
tional ratings based on the group tests, (5) 
the comparison ratings based on all cate- 
gories of psychological test data, and (c) the 
MSRPP measure of pathology derived from 
psychiatric interview and observed ward be- 
havior. These measures are suitable, there- 
fore, for testing the hypothesis that the ex- 
perimental group improved more than did the 
control group. 

Psychological Test Ratings. The mean pre- 
posttreatment changes on the cross-sectional 
ratings for the experimental and control 
groups, together with the differences in these 
changes between the two groups and their 
statistical significance, are summarized in 
Table 1. Since both control and experimental 
patients showed some improvement, only the 
cross-sectional ratings for which there exist 
significant differences between the pre- and 
posttreatment testing for either the control or 
the experimental group are included in this 
analysis. 

Consideration of these data reveals that all 
differences in improvement between the ex- 
perimental and control groups favor the ex- 
perimental group; that is, the ratings on those 
scales which differentiated significantly be- 
tween pre- and posttreatment testing reveal 
more improvement in the experimental than 
in the control group. Ratings based on the 
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TABLE 1 


COMPARISON OF EXPERIMENTAL AND CONTROL Groups ON Cross-SECTIONAL RATING SCALES 


Experimental Group 
Mean Change 
Rating Scale Score 


Group tests: 


Intellectual Efficiency 1.11 
Emotional Functioning 1.00 
Psychopathological Characteristics 1.42 


Interpersonal Efficiency — 
Nature of Social Contacts 1.16 
Need Gratification 


Self-Attitudes 


Group tests and Rorschach combined: 


Interpersonal Efficiency — 
Nature of social contacts 0.37 


here. 


Control Group 
Mean Change 
Score Difference* t 


0.69 0.42 1.36 
0.25 0.75 2.54* 
0.31 1.11 4.20*** 
0.37 0.78 2.94** 


0.13 0.23 0.96 


Note.—Only scales where comparison of pre- and posttreatment ratings gave significant t's (beyond .05 level) are presented 


® Positive values indicate differences favoring experimental group. 


* Indicates significance beyond .05 level. 
** Indicates significance beyond .01 .evel. 
*** Indicates significance beyond .001 level. 


group tests for three scales—Emotional Func- 
tioning, Psychopathological Characteristics, 
and Nature of Social Contacts—reached an 
acceptable measure of statistical significance 
(0.05, 0.001, and 0.01 level, respectively). 
Ratings on Need Gratification approached 
significance (beyond 0.10 level). 

These findings lend strong support to the 
hypothesis that short-term psychotherapy used 
as an adjunct to insulin-coma therapy leads 
to measurable improvement in schizophrenics 
over and above the changes resulting from in- 
sulin-coma treatment alone. If the psycholo- 
gists’ ratings on these three scales actually re- 
flect the traits as defined, the results indicate 
that short-term psychotherapy helped schizo- 
phrenic patients to learn more mature ways 
of handling their emotional reactions, to aban- 
don primitive psychotic thinking, and to im- 
prove their ability to function well in social 
situations. These are the kinds of changes 
which most psychotherapists would probably 
expect to find if psychotherapy were at all 
successful. 

Although none of the differences between 
the experimental and control groups on the 
comparison ratings reached statistical signifi- 


cance, all the ratings based on the group tests 
alone favored the experimental group, whereas 
ratings based on the Rorschach tended to fa- 
vor neither group consistently. 

MSRPP Ratings. Comparisons of the ex- 
perimental group and the control group with 
respect to improvement as reflected by changes 
from the MSRPP revealed that the experi- 
mental group showed a greater decrease in 
pathology than the control group. The ¢ test 
failed to reach statistical significance, how- 
ever, suggesting there is only a trend favoring 
the hypothesis that the experimental group 
made more progress than the control group. 

Post-insulin Hospitalization. Since the 
MSRPP ratings were made shortly following 
termination of insulin therapy, the tempo- 
rary leveling effects of the insulin may have 
overshadowed group differences which would 
become evident after the immediate effects of 
the insulin had dissipated. A follow-up study 
of these and similar cases might reveal more 
striking differences between the experimental 
and control groups than those found in this 
study which compared the groups immedi- 
ately following treatment. Preliminary fol- 
low-up data based on a survey made 2 months 
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following the present study lend support to 
this hypothesis. At that time only three of 
the experimental patients had not been either 
discharged from the hospital or released on 
trial visit as compared with eight of the con- 
trol patients. The average number of days 
an experimental patient remained hospitalized 
following the termination of insulin treat- 
ment was 38.4 as compared with 72.2 days 
for the control patient. Since the distribution 
of numbers of days was markedly skewed, a 
nonparametric method—the Mann-Whitney U 
test—was used to determine the significance 
of the difference between the two groups of 
patients. A U of 1.94 (p less than 0.05) was 
obtained, from which it can be concluded that 
insulin patients receiving psychotherapy are 
more likely to leave the hospital earlier than 
are those who receive insulin alone. This find- 
ing seems particularly important in view of 
the fact that the psychiatrist responsible for 
discharging the patients was unaware of 
whether any given patient had received psy- 
chotherapy. 


SUMMARY 


The present study was an attempt to test 
the hypothesis that short-term psychotherapy 


used as an adjunct to insulin-coma therapy 
results in measurable improvement in schizo- 
phrenics over and above that derived from in- 
sulin treatment alone. A control group of 18 
schizophrenics undergoing insulin-coma ther- 
apy was compared with an experimental 
group of 19 similar patients who received a 
combination of group and individual psycho- 
therapy in addition to the insulin treatment. 

Evaluative criteria included: (a) pre- and 
posttreatment ratings by a psychiatrist on 
the basis of an intensive interview using the 
MSRPP, (4) biweekly ratings made by ward 
personnel using the Ward Observation scale 
of the MSRPP, and (c) pre- and posttreat- 
ment ratings made on the basis of individu- 
ally-administered Rorschachs and a battery 
of group-administered psychological tests. All 
ratings were made in such a way as to be free 
from possible rater bias. 

Comparison of the improvement made by 
the experimental and control groups led to the 
following major findings: (a) the objective 
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psychiatric and behavioral ratings revealed 
differences between the improvement made 
by the two groups which consistently favored 
the experimental group, but these trends did 
not reach statistical significance; (5) the psy- 
chological test ratings, which had differenti- 
ated between pre- and posttreatment testing, 
revealed significantly greater improvement in 
the experimental group than in the control 
group on three important scales; (c) ratings 
based on comparisons between pre- and post- 
treatment protocols failed to differentiate be- 
tween the mean improvements made by the 
two groups; and (d) comparison of the two 
groups on length of time patients remained in 
the hospital following termination of insulin 
treatment revealed that the experimental pa- 
tients—on the average—left the hospital more 
than a month sooner than the control pa- 
tients, and this difference proved to be sta- 
tistically significant. These findings were in- 
terpreted as verifying the major hypothesis. 
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A NOTE ON THE SIGNIFICANCE OF DISCREPANCIES 
BETWEEN GOODENOUGH AND BINET IQ SCORES 


ELISE ELKINS LESSING 


Illinois Institute for Juvenile Research 


Several investigators have tried to ascertain 
the significance of discrepancies between in- 
tellectual ratings derived from the Good- 
enough Draw-A-Man Test on the one hand 
and standard intelligence tests on the other. 
It has been suggested that a negative devia- 
tion of the Goodenough IQ from other rat- 
ings might be an indication of either brain 
damage or emotional maladjustment (Good- 
enough, 1950). Hinrichs (1935) and Brill 
(1937) found support for this hypothesis in 
their comparisons of Binet and Goodenough 
IQ scores in populations of delinquents and 
defectives, respectively. Hanvik (1953) com- 
pared the WISC and the Goodenough IQs of 
25 patients in a child guidance clinic. He 
noted that the mean WISC Full Scale IQ for 
the group was 13.72 points higher than the 
mean Goodenough IQ score. In only 4 out of 
25 cases did a child’s Goodenough IQ exceed 
his own WISC Full Scale IQ score. He sug- 
gested that while the Goodenough might not 
furnish a valid assessment of the intellectual 
ability of a clinic population, it might pro- 
vide an index of neuroticism. 

In a pilot study for a larger research proj- 
ect, the present author obtained evidence that 
the discrepancy between Goodenough IQs and 
IQs on a standard test of intelligence can be 
as dramatically large in a nonclinic popula- 
tion as in the disturbed groups studied by 
Hinrichs and Hanvik. The author had access 
to male figure drawings made by 21 boys and 
2 girls who were described as well-adjusted 
by their teachers in the Chicago public 
schools. These children were part of a larger 
sample used in an Institute study of normal 
8 and 9 year old children. One experienced 


examiner administered Form L of the Re- 
vised Stanford-Binet to all the children and 
scored the protocols. The same examiner ad- 
ministered the drawing test. The word “per- 
son” was substituted for “man” in the Good- 
enough instructions; however, the encourage- 
ment to do the best job possible was retained. 
The drawings were scored by the author and 
a trained student worker. A reliability coeffi- 
cient of .95 was obtained. The student’s scor- 
ing was used as the basis for all subsequent 
computations. 

The mean Binet IQ of the 23 nonclinic chil- 
dren was found to be 119.96 with an SD of 
14.84. The mean Goodenough IQ of the group 
was found to be 92.17 with an SD of 17.26. 
The discrepancy of 27.79 IQ points between 
the mean Binet and the mean Goodenough 
IQ of this nonclinic group is actually larger 
than the discrepancy of 13.72 points which 
Hanvik found between the WISC and Good- 
enough IQs of his clinic sample. In the non- 
clinic group being described, the correlation 
between the Binet and Goodenough IQ scores 
was .51, which is significantly different from 
zero at only the .05 level. 

Thus, the situation is somewhat less grati- 
fying than Hanvik’s discussion would indi- 
cate. The Goodenough has questionable va- 
lidity as a measure of the intellectual level of 
both nonclinic and clinic groups. One must be 
extremely cautious in interpreting a lower 
Goodenough than WISC or Binet IQ as evi- 
dence of emotional disturbance or brain dam- 
age: the same pattern of test score relation- 
ships can be found in populations presumably 
characterized by neither of these pathological 
conditions. 
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EDWARD ZIGLER 


Yale University 


This study investigated how individuals in each 
of the major groups of functional disorders dif- 
fered from the population at large and from one 
another on the variables of age, intelligence, edu- 
cation, occupation, employment history, and mari- 
tal status. The study was based on an examina- 
tion of the case history data of 793 patients 
admitted to Worcester State Hospital during a 
12-year period (1945-1957) and referred to the 
hospital Psychology Department for appraisal. 
The diagnosis ascribed to each patient was that 
psychiatric classification agreed upon at a diag- 
nostic staff conference. The patients were cate- 
gorized into four major diagnostic groups: manic- 
depressive (37 men, 38 women); schizophrenic 
(165 men, 122 women); psychoneurotic (31 men, 
71 women); and character disorder (197 men, 
82 women). The 1950 census data for the state 
of Massachusetts was employed to compare the 
total hospital population and the specific diag- 
nostic groups to the population at large on the 
variables investigated. 

An effort was made to determine whether any 
systematic differences on the biographical vari- 
ables existed between the population employed in 
the study and a sample of 111 patients who had 
not been referred for psychological appraisal. 
This latter sample was matched with the present 


1 An extended report ef this study may be obtained 
without charge from Leslie Phillips (Worcester State 
Hospital; Worcester, Massachusetts) or for a fee 
from the American Documentation Institute. Order 
Document No. 6829 from ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Library of 
Congress; Washington 25, D. C., remitting in ad- 
vance $2.00 for microfilm or $3.75 for photocopies. 
Make checks payable to: Chief, Photoduplication 


Service, Library of Congress. 

This investigation was supported by the Dementia 
Praecox Research Project, Worcester State Hospital, 
and a research grant (M896) from the National In- 
stitute of Mental Health, United States Public Health 
Service. 
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CASE HISTORY DATA AND PSYCHIATRIC DIAGNOSIS * 


LESLIE PHILLIPS 


Worcester State Hospital 


population on the variables of sex, year admitted 
to the hospital, and diagnosis received. This com- 
parison revealed that there were no differences 
between the two groups on the variables of oc- 
cupation, education, or employment history. On 
the age variable, it was found that the popula- 
tion used in this study is younger than the gen- 
eral hospital sample. In the marital status vari- 
able, it was found that patients who are single 
are equally represented in both samples but that 
married patients are under-represented while in- 
dividuals who fall into the “other” category are 
over-represented in the population used in this 
study. No comparisons could be made on the in- 
telligence variable. These differences should be 
kept in mind in generalizing the findings of the 
present study to a random population of patients 
suffering from functional disorders. 

The most striking finding of this study is that 
the hospital population and the diagnostic groups 
that comprise it are not representative of the 
population at large. In general, hospitalized indi- 
viduals are drawn from that portion of the popu- 
lation which has the most difficulty in meeting 
social expectancies. 

The results also indicate that the individual 
diagnostic groups differ from the population at 
large and from one another in rather specific 
ways. The distributions of the manic-depressive 
and psychoneurotic groups on the biographical 
variables display many similarities, while they 
differ markedly from the distributions of the 
schizophrenic and character disorder groups. The 
distributions of the schizophrenic and character 
disorder groups also display considerable simi- 
larity to one another. In general, the distribu- 
tions of the manic-depressive and psychoneurotic 
groups more nearly resemble the distribution of 
the normal population than do the distributions of 
the schizophrenic and character disorder groups. 


(Received November 23, 1960) 
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HOMOSEXUAL PREJUDICE AND PERCEPTUAL DEFENSE * 


LOUIS BREGER ann SHEPHARD LIVERANT 
Ohio State University 


In accordance with Adorno, Frenkel-Brunswik, 
Levinson, and Sanford’s (1950) conceptualization 
of prejudice (i.e., prejudice serves as a defense 
against one’s own unacceptable impulses) we pre- 
dicted that individuals scoring high on a measure 
of homosexual prejudice would show greater in- 
dices of threat to homosexual words presented in 
a perceptual defense situation. 

On the basis of their scores on a specially con- 
structed Likert-type scale of manifest attitudes 
toward homosexuality (H scale) male subjects 
were divided into a high prejudice group (N 
= 21); a median group (V=25) and a low 
group (V = 22). Following individual administra- 
tion of the H scale each subject was placed in a 
perceptual defense situation utilizing the succes- 
sive carbon method of presentation. The final 
measure of threat was obtained by comparing 
the means of each subject on four homosexual, 
four sexual, and eight neutral words. 

Perceptual defense of the avoidance type was 
demonstrated on the homosexual and sexual words 
for the total group (tf = 5.65, p< .001 between 
neutral and homosexual words; t = 7.50. p < .001 
between neutral and sexual words). However, a 
significant difference was not found between the 
homosexual and sexual words. 

The differential reaction times to the three 
groups of words by the high, middle, and low H 
scale scorers indicated that group differences in 


1 An extended report of this study may be obtained 
without charge from Shephard Liverant (Department 
of Psychology, Ohio State University; Columbus, 
Ohio) or for a fee from the American Documenta- 
tion Institute. Order Document No. 6830 from ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress; Washington 25, D. C., 
remitting in advance $1.25 for microfilm or $1.25 
for photocopies. Make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 


defensiveness to the homosexual words of either 
the avoidance or vigilance type were mot mani- 
fested. 

These results consistently fail to support the 
major hypothesis concerning prejudice and de- 
fensiveness. However, assuming the adequacy of 
our measures, an alternative explanation presents 
itself. The adequacy of the perceptual defense 
test as an index of threat tends to be substan- 
tiated by the significantly longer reaction times 
on the taboo words. The high test-retest reliabil- 
ity coefficient (Pearson r = .91, N = 43), the re- 
sults of an item analysis, and the successful con- 
trol of an acquiescence set all indicate that the 
H scale is measuring some variable in a meaning- 
ful manner and a consideration of the undis- 
guised nature of the items makes it appear likely 
that H scale scores do reflect manifest attitudes 
toward homosexuality. 

The alternative explanation, namely that the 
attitudes reflected in the H scale are for most 
subjects learned stereotypes, appears to account 
for the obtained results. The negative results with 
the perceptual defense measure are not unex- 
pected, since the notion of learned stereotypes 
involves no assumptions regarding homosexual 
prejudice as a defense against repressed impulses. 
Further corroboration of the stereotype hypothe- 
sis is provided by the finding that fraternity mem- 
bers and applied majors score significantly higher 
on the H scale than nonfraternity men and liberal 
arts majors, since the former groups would be 
more likely to hold stereotyped attitudes of all 
kinds. 
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The purpose of the present study was to cross- 
validate Liverant’s (1959) earlier findings from 
which he concluded that the obtained differences 
in mean MMPI profiles 


strongly supported the general clinical impression 
that both fathers and mothers of disturbed children 
are themselves more maladjusted than the fathers 
and mothers of nondisturbed children (p. 260). 


The MMPI profiles of 50 pairs of parents of 
disturbed children seen at the University of 
Iowa's Child Psychiatry Service were statistically 
compared by ¢ tests with those of a matched con- 
trol group of parents of nondisturbed Iowa chil- 
dren. While the experimental group means were 
generally higher than those of the control group, 
the differences were only significant (p< .05) 
for the D, Hy, and Pa scales for the mothers and 
for the Hs and D scales for the fathers. 

Twenty-three of the 50 experimental fathers 
and 13 of the 50 control fathers had one or more 
T scale scores at or above 70 (the usual cutting 
score’ indicating pathology) as did 26 of the 50 
experimental mothers and 12 of the 50 control 
mothers. The chi square analyses indicated that 
the obtained differences between the two groups 
were statistically significant (p < .05). 

The experimental group was then divided into 
four diagnostic subgroups based upon the pre- 
dominant symptoms involved in the child’s dis- 
turbance: 

1. Schizophrenic group (S)—children who were 
diagnosed as schizophrenic, autistic, or severely 
schizoid (V = 11). 


1 An extended report of this study may be obtained 
without charge from Leonard D. Goodstein (Depart- 
ment of Psychology, University of Iowa; Iowa City, 
Iowa) or for a fee from the American Documenta- 
tion Institute. Order Document No. 6831 from ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress; Washington 25, D. C., 
remitting in advance $1.25 for microfilm or $1.25 
for photocopies. Make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 


A FURTHER STUDY OF MMPI DIFFERENCES BETWEEN PARENTS 
OF DISTURBED AND NONDISTURBED CHILDREN * 


LEONARD D. GOODSTEIN ano VINTON N. ROWLEY 


University of Iowa 
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2. Acting Out group (AO)—children who ex- 
hibited acting out, delinquent, or antisocial be- 
havior (VW = 17). 

3. Personality Trait group (PT)—children 
diagnosed as personality trait disturbances, in- 
cluding adjustment reactions of childhood, school 
problems, and enuresis (VN = 13). 

4. Neurotic group (N)—children diagnosed as 
neurotic, including depressions, phobias, and anx- 
iety states (V =9). 

The mean MMPI scale scores for the parents 
within each of these four diagnostic subgroups 
were compared by ¢ tests with those from each 
of the other subgroups, separately by sex. While 
the means of the fathers of PT and N children 
were generally higher than those of the other 
two subgroups, none of the 54 ¢ tests was sig- 
nificant. The means of the mothers of the AO 
children were. in general, the highest with those 
of the other three groups lower. Ten of the 54 
comparisons involving the mothers, however, were 
significant with AO > PT on D, Hy, and Pd, 
AO>WN on Pd, N > PT on D and Hy, PT>N 
on Ma, PT >S on Mf,S > PT on Pd, andS >N 
on Pd (for all, p < .05). 

In general, the results of the present study sup- 
port the earlier conclusions of Liverant (1959) 
that parents of disturbed children are themselves 
more maladjusted than parents of nondisturbed 
children, although they are mot themselves as 
disturbed as persons who seek help for their own 
psychological problems. Although the mean pro- 
files of Liverant’s experimental groups, especially 
the fathers, were somewhat higher than those ob- 
tained in the present investigation, the pattern of 
the results in these two studies appears quite 
similar. 
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PREDICTING IMPROVEMENT OF PSYCHIATRIC PATIENTS 
FROM EARLY WARD SOCIALIZING RATINGS 


GRACE PENNINGTON SURBER 
New Hampshire State Hospital 


This study investigated the hypothesis that 
nonstructured ward socializing behavior during 
the first week following admission to a state hos- 
pital is positively correlated with improvement 
over a 9-week period of hospitalization. 

Forty-two female psychiatric patients who had 
been committed to a state hospital for the first 
time served as subjects. They ranged in age from 
20 to 59 with a mean age of 39, and were se- 
lected from consecutive admissions of functional 
disorders. 

A rating scale (SWIS) provided five classifica- 
tions of socializing behavior ranging from out- 
going to isolate behavior. Within 48 hours fol- 
lowing admission, two observers rated each sub- 
ject for 1 minute, six times daily for 5 days. 
Inter-rater reliability was rho = .99. A subject’s 
final socializing score was expressed as the me- 
dian of her ratings. 

Achievement of the following six hospital privi- 
leges made up the criteria of improvement: off 
ward, on grounds, off grounds, transfer to con- 
valescent ward, weekend home visit, convales- 
cent status. The time limit for meeting each cri- 
terion was set at the hospital’s average for that 
particular privilege. 


1 An extended report of this study may be obtained 
without charge from Grace Pennington Surber (Psy- 
chology Department, New Hampshire State Hos- 
pital; Concord, New Hampshire) or for a fee from 
the American Documentation Institute. Order Docu- 
ment No. 6832 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


The population was divided into two groups 
according to socializing behavioral scores. Scores 
above and below the midpoint of the rating scale, 
indicating higher and lower levels of socializing 
behavior, were compared with the number of sub- 
jects who met or did not meet each criterion. 
Scores of those who succeeded or failed to achieve 
the various criteria were tested for overlap. 

The data supported the hypothesis that first 
week time-sampling behavioral ratings could be 
used to predict individual, as well as group, rate 
of improvement. With Fisher’s exact probability 
test for subjects who showed higher or lower 
levels of socializing behavior, significant differ- 
ences in time of achieving the criteria were found 
at or beyond the .02 level for five privileges and 
.O5 for one privilege. The small overlap in be- 
havioral ratings between groups who had, and 
had not, met each criterion of improvement 
within the specified time limits, provided cutoff 
scores which could have permitted individual pre- 
diction. 

The results add statistical strength to previous 
studies which stress the importance of social proc- 
esses in the recovery of psychotic patients. They 
also suggest application of the method for spe- 
cific treatments, as well as for further research. 
While the study has not identified any causal fac- 
tors, it has shown a clear relationship between 
patient interactions and prognosis. It would seem, 
therefore, important not to overlook the possi- 
bility that patient interactional activity might be 
therapeutic in and of itself. 


(Received December 29, 1960) 
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Previous factor analytic studies of the 10 scales 
of the Guilford-Zimmerman Temperament Sur- 
vey have shown that the Restraint (R) and 
Thoughtfulness (T) scales have an intercorrela- 
tion of approximately .40 and that these scales 
apparently define one of the four second-order 
factors in the GZTS: the factor of extraversion- 
introversion (EI). In order to develop a short 
and factorily saturated EI scale for personality 
research the 60 R and T scale items were sub- 
jected to four overlapping factor analyses to de- 
termine the factor loadings of each item. Within 
each analysis the items were intercorrelated using 
the phi coefficient, two centroid factors were ex- 
tracted using the complete centroid method, and 
the loadings were analytically rotated to oblique 
simple structure by the oblimax criterion.? Three 
groups of college subjects were involved in the 
analyses: Group I consisted of 300 male fresh- 
men and Group II of 130 male freshmen who 
completed the full length GZTS inventory, while 
Group III was composed of 145 subjects of both 
sexes enrolled in introductory psychology who re- 
sponded to a subset of R and T items imbedded 
in another personality inventory. 

The R and T scale scores of the Group I sub- 
jects were summed and each item correlated with 


1 An extended report of this study may be obtained 
without charge from A. W. Bendig (Department of 
Psychology, University of Pittsburgh; Pittsburgh 13, 
Pennsylvania) or for a fee from the American Docu- 
mentation Institute. Order Document No. 6833 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress; Washington 25, 
D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to: 
Chief, Photoduplication Service, Library of Congress. 

2The author’s appreciation is extended to Gary 
Lotto and William B. Kehl of the university’s Com- 
putation and Data Processing Center for providing 
facilities for the statistical analysis. 


IMPROVING THE FACTORIAL PURITY OF GUILFORD’S 
RESTRAINT AND THOUGHTFULNESS SCALES 


A. W. BENDIG 
University of Pittsburgh 


this EI factor score. The 30 items correlating 
most highly were administered to Group III and 
these items factor analyzed. The 60 R and T 
items were divided into two subsets of 30 items 
(15 R and 15 T) and two factor analyses per- 
formed. The correlations between the R and T 
scores in both groups were .42, but the average 
(median) correlation between the two oblique 
factors was only .17. The 40 items with the most 
consistent and largest factor loadings were se- 
lected for analysis using the item responses of 
Group II. The correlation between the R and T 
factors was .21 and only one of the items had 
loadings inconsistent with the preceeding analy- 
ses. All four analyses showed certain items to be 
incorrectly keyed as to factor affiliation. 

The responses of the Group III subjects were 
scored for two pairs of R and T scores: the 
regular 30-item scales using the Guilford-Zim- 
merman keying and new 20-item scales using the 
item keying suggested by the first three factor 
analyses. The reliabilities of all four scores aver- 
aged about .71 (.69 to .73), but the correlation 
between the 30-item scales of .35 dropped to a 
nonsignificant .16 between the new 20-item scales. 

It was concluded that the moderate correlation 
between the regular R and T scales was due to 
the factorily impure item content of the scales 
and that there is only a small correlation between 
the R and T factors. It was also suggested that 
the EI second-order factor may be solely a psy- 
chometric artifact of these contaminated scales. 

The new 20-item scales were obtained by (a) 
omitting GZTS Items 2, 12, 17, 42, 47, 62, 92, 
107, 112, 122, 153, 163, 168, 178, 208, 213, 223, 
228, 243, and 288; (b) Scoring Items 173, 233, 
and 263 for factor R; and (c) Scoring Items 37, 
87, and 97 for factor T. The direction of scoring 
(true or false) is as given in the GZTS scale keys. 


(Received January 3, 1961) 
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An indispensable book for anyone concerned with the 
clinical process as it relates to the psychopathology of children 


CLINICAL PROCESS 


by E. KUNO BELLER 
Director of Research, Child Development Center, New York 


In this new publication, Dr. Beller proposes a system for organizing clinical data 
in the treatment of personality disorders in children and their parents. Derived 
from intensive research and a study of basic concepts, the book offers a series of 
guides for recording data in each phase of the clinical process. These guides 
serve as models for organizing and assessing clinical data so as to enhance the 
usefulness of clinical records in teaching, professional service, and research. 


The book, however, goes beyond the organization of clinical data and presents 
a quantitative mee for analyzing a evaluating clinical records in practice 
and research. Dr. Beller shows how this method can be used to assess the 
suitability of any set of clinical records for specific research studies. $10.00 
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Neurology and Psychiatry 
Section VIII of Excerpta Medica 


The Neurology Section includes abstracts of all articles on neurological surgery in ad- 
dition to general clinical neurology and experimental neurophysiology. 


The section on Psychiatry covers the topics usually contained within the meaning of 
the word. In addition the following special branches are dealt with fully: psychoanalysis, 
individual and analytical psychology; sexology, criminology, alcoholism, and drug addic- 
tion as related to psychiatry; psychosomatic medicine; social and industrial psychology 
and psychiatry, vocational guidance and mental testing; psychometrics; personality 
testing and the Rorschach test; heredity and statistical studies as they apply to psychi- 
atry. The subjects of mental defect and epilepsy are fully reported. 


Annual volume’ of 1400 pp., including an index classified by author and subject. 


Subscription price $27.00 per annual volume 


EXCERPTA MEDICA FOUNDATION 


The New York Academy of Medicine Building 
2 East 103 Street New York 29, N.Y. 
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WHAT 
DISTINGUISHES 


DEVEREUX 


in its service to children who need remedial edu- 
cation? It furnishes — 


1. Group living and learning experience with others 
of a similar aptitude and level of development. 


2. The functioning of a multidisciplinary team with 
long experience in evaluating potential and in structur- 
ing programs in a residential setting unique in its wide 
range of homogeneous groupings. 


3. A philosophy of optimum blending of traditional 
methods with the best of the new from the frontiers of 
research. 


4. Established programs of diagnosis, treatment, re- 
search, and training soundly based on the wide spec- 
frum of a multidisciplinary team of experts. 


Professional inquiries ore invited. East Coast residents, address Chories 
J. Fowler, Director of Admissions, Devereux Schools, Devon, Pennsylvanic. 
West? Coos? residents, address Keith A. Seaton, Registrar, Devereux Schools, 
Box 1079, Sonte Berbera, California. Southwustern residents, address 
John M. Barclay, Director of Development, Devereux Schools, Box 2269, 


Victorio, Texas. 
THE 
DEVEREUX | 
FO J N DATI Oo N COMMUNITIES 


A nonprofit organization CAMPS 
Founded 1912 TRAINING 


Devon, Pennsylvania RESEARCH; 
Sonto Borbara, California 


Victoria, Texas 


HELENA T. DEVEREUX EDWARD L. FRENCH, Ph.D. 
Administrative Consultant Diractor 
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PSYCHOLOGICAL TEST SPECIALISTS 


Exclusive publishers and distributors of selected psychological tests 


Memory-For-Designs Test (MFD) 
Simple drawing test of perceptual-motor coordination, serving as extremely sensitive detector of brain injury 
of many types. $8.50 for set of materials 
Group Personality Projective Test (GPPT) 


Multiple-choice questions about ambiguous stick-figures make possible rapid and effective measurement of 
significant, factor-analytically isolated aspects of personality. $13.50 for set of materials 


Id-Ego-Superego Test (IES) 

view. $28.50 for set of materials 

Kahn Test of Symbol Arrangement (KTSA) 
Uses symbolic plastic objects which are arranged and sorted; validly discriminates among normals, neurotics, 

brain-injured, and psychotics, and indicates accompanying personality dynamics. 

$25.00 for set of materials 

Full-Range Picture Vocabulary Test (FRPY) 
Brief test of intelligence in two forms using the same 16 plates. Words are defined by choosing pictures. 
Valuable for intelligence screening, and with the physically handicapped. $7.50 for set of materials 


Geist Picture Interest Inventory (GPI!) 


Provides for more valid expression of interests by choices among alternative pictured activities. Both 
qaalitative and quantitative interpretations are possible. $6.00 for set of materials 


Proverbs Test (PT) 


Group and individual forms determine similarity of thought processes to those of various normal and clinical 
groups. Highly sensitive indicator of psychotic processes. $4.50 for kit 


Vocational Apperception Test: Advanced Form (VAT :ADV) 
Separate series’ of plates showing male and female college-related occupationa! activities, accompanied by 
explanation of use and interpretation as a projective test. $7.25 for set of materials 


Famous Sayings (FS) 
Particularly useful in personnel selection. $4.50 for kit 


It Scale For Children (ITSC) 
An experimental picture scale for measuring sex role preference in children. $15.00 for set of materials 


Kahn Intelligence Tests: Experimental Form (KIT :EXP) 


intelligence tests requiring no reading or writing. Supplementary scales for assess- 
ment of intelligence of blind, ability in concept formation, recall, and motor coordination. One scale can 
be administered by sign language. $26.00 for set of materials 


PSYCHOLOGICAL TEST SPECIALISTS 
Box 1441 Missouta, Montana 
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